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METHODS OF DIAGNOSIS OF BLADDER CANCER, COMPOSITIONS AND 
METHODS OF SCREENING FOR MODULATORS OF BLADDER CANCER 

CROSS-REFERENCES TO RELATED APPLICATIONS 
5 This application is related to USSN 60/302,814, filed July 3, 2001; USSN 60/310,099, 

filed August 3, 2001; USSN 60/343,705, filed November 8, 2001; USSN 60/350,666, filed 
November 13, 2001; and USSN 60/372,246, filed April 12, 2001, each of which is 
incorporated herein by reference. 

1 0 FIELD OF THE INVENTION 

The invention relates to the identification of nucleic acid and protein expression 
profiles and nucleic acids, products, and antibodies thereto that are involved in bladder 
cancer; and to the use of such expression profiles and compositions in the diagnosis, 
prognosis, and therapy of bladder cancer. The invention further relates to methods for 
1 5 identifying and using agents and/or targets that inhibit bladder cancer. 

BACKGROUND OF THE INVENTION 
In the United States, over 50,000 new cases of bladder cancer are diagnosed annually, 
and more than 10,000 deaths will be attributed to bladder cancer. Bladder cancer is now the 
20 fourth most common cancer among American men and the ninth most common cancer 

among American women. It occurs three times more frequently in men than in women, and it 
occurs roughly twice more frequently in white versus black men. 

Bladder cancer rarely occurs in people younger than 40 years of age, being primarily 
a disease of older men. Nonetheless, bladder cancer is a significant cause of illness and death 
25 in the United States. The risk of bladder cancer increases steeply with age, with over half of 
all bladder cancer deaths occurring after age 70. In white men older than 65, the annual 
disease rate of bladder cancer is approximately 2 cases per 1,000 persons; this contrasts with 
a rate of 0.1 cases per 1,000 persons younger than 65. 
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Within the United States, bladder cancer rates are higher among people who reside in 
northern versus southern states, and is higher for people who live in urban versus rural areas. 
Although this difference suggests that environmental as well as genetic factors may 
contribute to the development and progression of the disease, other studies confirm that 
5 certain genes play a role in bladder cancer.. For example, expression of the tumor suppressor 
gene p53 has been associated with an adverse prognosis for patients with invasive bladder 
cancer. A retrospective study of 243 patients treated by radical cystectomy found that the 
presence of nuclear p53 was an independent predictor for recurrence among patients with mid 
to late stage tumors. Esrig, et al (1994) N.E.J. Med. 331:1259-64. 

10 Urinary bladder cancers represent a spectrum of diseases that can be grouped into 

three general categories: superficial, invasive, and metastatic. The prognosis for treatment is 
highly dependent on the stage at which the tumor is first diagnosed. A unique aspect of 
bladder cancer treatment is that repeated surgical biopsy is an integral part of routine patient 
management. This has permitted the conduct of molecular genetic studies of tumors from 

1 5 specific stages of the disease. The results of these studies suggest that bladder cancers 

develop and progress along at least two discrete pathways, which may account for differences 
in invasiveness and metastatic" potential. Incorporating molecular genetic factors into the 
current paradigm for diagnosis and treatment will optimize the probability of cure and allow 
the quality of life for bladder cancer patients to be maintained. 

20 Early detection and treatment can prevent reoccurrence and progression of the disease 

to an incurable stage. Thus, the identification of novel diagnostic markers and therapeutic 
. targets will improve the current treatment of bladder cancer patients. While industry and 
academia have identified novel sequences, there has not been an equal effort exerted to 
identify the function of these novel sequences in disease states. The elucidation of a role for 

25 novel proteins and compounds in disease states for identification of diagnostic markers and 
therapeutic targets is essential for improving the current treatment of bladder cancer patients. 
Accordingly, provided herein are methods that can be used in diagnosis and prognosis of 
bladder cancer. Additionally, provided herein are molecular targets for therapeutic 
intervention in bladder cancer and other related bladder diseases.. Further provided are 
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methods that can be used to screen candidate bioactive agents for the ability to modulate 
bladder cancer. 

SUMMARY OF THE INVENTION 
5 The present invention therefore provides nucleotide sequences of genes that are up- 

and down-regulated in bladder cancer cells. Such genes are useful for diagnostic purposes, 
and also as targets for screening for therapeutic compounds that modulate bladder cancer, 
such as hormones or antibodies. Other aspects of the invention will become apparent to the 
skilled artisan by the following description of the invention. 
1 0 In one aspect, the present invention provides a method of detecting a bladder cancer- 

associated transcript in a cell from a patient, the method comprising contacting a biological 
sample from the patient with a polynucleotide that selectively hybridizes to a sequence at 
least 80% identical to a sequence as shown in Tables 1A-13. 

In one embodiment, the present invention provides a method of determining the level 
15 of a bladder cancer associated transcript in a cell from a patient. 

In one embodiment, the present invention provides a method of detecting a bladder 
cancer-associated transcript in a cell from a patient, the method comprising contacting a 
biological sample from the patient with a polynucleotide that selectively hybridizes to a 
sequence at least 80% identical to a sequence as shown in Tables 1 A-13. 
20 In one embodiment, the polynucleotide selectively hybridizes to a sequence at least 

95% identical to a sequence as shown in Tables 1 A-13. 

In one embodiment, the biological sample is a tissue sample. In another embodiment, 
the biological sample comprises isolated nucleic acids, e.g., mRNA. 

In one embodiment, the polynucleotide is labeled, e.g., with a fluorescent label. 
25 In one embodiment, the polynucleotide is immobilized on a solid surface. 

In one embodiment, the patient is undergoing a therapeutic regimen to treat bladder 
cancer. In another embodiment, the patient is suspected of having metastatic bladder cancer. 
In one embodiment, the patient is a human. 

In one embodiment, the bladder cancer associated transcript is mRNA. 
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In one embodiment, the method further comprises the step of amplifying nucleic acids 
before the step of contacting the biological sample with the polynucleotide. 

In another aspect, the present invention provides a method of monitoring the efficacy 
of a therapeutic treatment of bladder cancer, the method comprising the steps of: (i) providing 
5 a biological sample from a patient undergoing the therapeutic treatment; and (ii) determining 
the level of a bladder cancer-associated transcript in the biological sample by contacting the 
biological sample with a polynucleotide that selectively hybridizes to a sequence at least 80% 
identical to a sequence as shown in Tables 1A-13, thereby monitoring the efficacy of the 
therapy. In a further embodiment, the patient has metastatic bladder cancer. In a further 
10 embodiment, the patient has a drug resistant form of bladder cancer. 

In one embodiment, the method further comprises the step of: (iii) comparing the 
level of the bladder cancer-associated transcript to a level of the bladder cancer-associated 
transcript in a biological sample from the patient prior to, or earlier in, the therapeutic 
treatment. 

1 5 Additionally, provided herein is a method of evaluating the effect of a candidate 

bladder cancer drug comprising administering the drug to a patient and removing a cell 
sample from the patient. The expression profile of the cell is then determined. This method 
may further comprise comparing the expression profile to an expression profile of a healthy 
individual. In a preferred embodiment, said expression profile includes a gene of Tables 1 A- 

20 13. 

In one aspect, the present invention provides an isolated nucleic acid molecule 
consisting of a polynucleotide sequence as shown in Tables 1A-13. 

In one embodiment, an expression vector or cell comprises the isolated nucleic acid. 
In one aspect, the present invention provides an isolated polypeptide which is encoded 
25 by a nucleic acid molecule having polynucleotide sequence as shown in Tables 1 A-13. 

In another aspect, the present invention provides an antibody that specifically binds to 
an isolated polypeptide which is encoded by a nucleic acid molecule having polynucleotide 
sequence as shown in Tables 1 A-13. 

In one embodiment, the antibody is conjugated to an effector component, e.g., a 
30 fluorescent label, a radioisotope or a cytotoxic chemical. 
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In one embodiment, the antibody is an antibody fragment. In another embodiment, 
the antibody is humanized. 

In one aspect, the present invention provides a method of detecting a bladder cancer 
cell in a biological sample from a patient, the method comprising contacting the biological 
5 sample with an antibody as described herein. 

In another aspect, the present invention provides a method of detecting antibodies 
specific to bladder cancer in a patient, the method comprising contacting a biological sample 
from the patient with a polypeptide encoded by a nucleic acid comprising a sequence from 
Tables 1A-13. 

10 In another aspect, the present invention provides a method for identifying a compound 

that modulates a bladder cancer-associated polypeptide, the method comprising the steps of: 
(i) contacting the compound with a bladder cancer-associated polypeptide, the polypeptide 
encoded by a polynucleotide that selectively hybridizes to a sequence at least 80% identical 
to a sequence as shown in Tables 1 A-13; and (ii) determining the functional effect of the 

1 5 compound upon the polypeptide. 

In one embodiment, the functional effect is a physical effect, an enzymatic effect, or a 
chemical effect. 

In one embodiment, the polypeptide is expressed in a eukaryotic host cell or cell 
membrane. In another embodiment, the polypeptide is recombinant. 
20 In one embodiment, the functional effect is determined by measuring ligand binding 

to the polypeptide. 

In another aspect, the present invention provides a method of inhibiting proliferation 
of a bladder cancer-associated cell to treat bladder cancer in a patient, the method comprising 
the step of administering to the subject a therapeutically effective amount of a compound 
25 identified as described herein. 

In one embodiment, the compound is an antibody. 

In another aspect, the present invention provides a drug screening assay comprising 
the steps of: (i) administering a test compound to a mammal having bladder cancer or to a 
cell sample isolated therefrom; (ii) comparing the level of gene expression of a 
30 polynucleotide that selectively hybridizes to a sequence at least 80% identical to a sequence 

5 
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as shown in Tables 1 A-13in a treated cell or mammal with the level of gene expression of the 
polynucleotide in a control cell sample or mammal, wherein a test compound that modulates 
the level of expression of the polynucleotide is a candidate for the treatment of bladder 
cancer. 

5 In one embodiment, the control is a mammal with bladder cancer or a cell sample 

therefrom that has not been treated with the test compound. In another embodiment, the 
control is a normal cell or mammal. 

In one embodiment, the test compound is administered in varying amounts or 
concentrations. In another embodiment, the test compound is administered for varying time 
10 periods. In another embodiment, the comparison can occur after addition or removal of the 
drug candidate. 

In one embodiment, the levels of a plurality of polynucleotides that selectively 
hybridize to a sequence at least 80% identical to a sequence as shown in Tables 1A-13 are 
individually compared to their respective levels in a control cell sample or mammal. In a 
15 preferred embodiment the plurality of polynucleotides is from three to ten. 

In another aspect, the present invention provides a method for treating a mammal 
having bladder cancer comprising administering a compound identified by the assay 
described herein. 

In another aspect, the present invention provides a pharmaceutical composition for 
20 treating a mammal having bladder cancer, the composition comprising a compound identified 
by the assay described herein and a physiologically acceptable excipient. 

In one aspect, the present invention provides a method of screening drug candidates 
by providing a cell expressing a gene that is up- and down-regulated as in a bladder cancer. 
In one embodiment, a gene is selected from Tables 1A-13. The method further includes 
25 adding a drug candidate to the cell and determining the effect of the drug candidate on the 
expression of the expression profile gene. 

In one embodiment, the method of screening drug candidates includes comparing the 
level of expression in the absence of the drug candidate to the level of expression in the 
presence of the drug candidate, wherein the concentration of the drug candidate can vary 
30 when present, and wherein the comparison can occur after addition or removal of the drug 
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candidate. In a preferred embodiment, the cell expresses at least two expression profile 
genes. The profile genes may show an increase or decrease. 

Also provided is a method of evaluating the effect of a candidate bladder cancer drug 
comprising administering the drug to a transgenic animal expressing or over-expressing the 
5 bladder cancer modulatory protein, or an animal lacking the bladder cancer modulatory 
protein, for example as a result of a gene knockout. 

Moreover, provided herein is a biochip comprising one or more nucleic acid segments 
of Tables 1 A- 13, wherein the biochip comprises fewer than 1000 nucleic acid probes. 
Preferably, at least two nucleic acid segments are included. More preferably, at least three 
10 nucleic acid segments are included. 

Furthermore, a method of diagnosing a disorder associated with bladder cancer is 
provided. The method comprises determining the expression of a gene of Tables 1 A- 13 in a 
first tissue type of a first individual, and comparing the distribution to the expression of the 
gene from a second normal tissue type from the first individual or a second unaffected 
1 5 individual. A difference in the expression indicates that the first individual has a disorder 
associated with bladder cancer. 

In a further embodiment, the biochip also includes a polynucleotide sequence of a 
gene that is not up- and down-regulated in bladder cancer. 

In one embodiment a method for screening for a bioactive agent capable of interfering 
20 with the binding of a bladder cancer modulating protein (bladder cancer modulatory protein) 
or a fragment thereof and an antibody which binds to said bladder cancer modulatory protein 
or fragment thereof. In a preferred embodiment, the method comprises combining a bladder 
cancer modulatory protein or fragment thereof, a candidate bioactive agent and an antibody 
which binds to said bladder cancer modulatory protein or fragment thereof. The method 
25 further includes determining the binding of said bladder cancer modulatory protein or 
fragment thereof and said antibody. Wherein there is a change in binding, an agent is 
identified as an interfering agent. The interfering agent can be an agonist or an antagonist. 
Preferably, the agent inhibits bladder cancer. 

Also provided herein are methods of eliciting an immune response in an individual. 
30 In one embodiment a method provided herein comprises administering to an individual a 
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composition comprising a bladder cancer modulating protein, or a fragment thereof. In 
another embodiment, the protein is encoded by a nucleic acid selected from those of Tables 
1A-13. 

Further provided herein are compositions capable of eliciting an immune response in 
5 an individual. In one embodiment, a composition provided herein comprises a bladder cancer 
modulating protein, preferably encoded by a nucleic acid of Tables 1 A- 13 or a fragment 
thereof, and a pharmaceutically acceptable carrier. In another embodiment, said composition 
comprises a nucleic acid comprising a sequence encoding a bladder cancer modulating 
protein, preferably selected from the nucleic acids of Tables 1A-13, and a pharmaceutically 

10 acceptable carrier. 

Also provided are methods of neutralizing the effect of a bladder cancer protein, or a 
fragment thereof, comprising contacting an agent specific for said protein with said protein in 
an amount sufficient to effect neutralization. In another embodiment, the protein is encoded 
by a nucleic acid selected from those of Tables 1A-13. 

1 5 In another aspect of the invention, a method of treating an individual for bladder 

cancer is provided. In one embodiment, the method comprises administering to said 
individual an inhibitor of a bladder cancer modulating protein. In another embodiment, the 
method comprises administering to a patient having bladder cancer an antibody to a bladder 
cancer modulating protein conjugated to a therapeutic moiety. Such a therapeutic moiety can 

20 be a cytotoxic agent or a radioisotope. 

DETAILED DESCRIPTION OF THE INVENTION 
In accordance with the objects outlined above, the present invention provides novel 
methods for diagnosis and prognosis evaluation for bladder disease (BD), e.g., cancer, 
25 including metastatic bladder cancer, as well as methods for screening for compositions which 
modulate bladder diseases. Also provided are methods and compositions for treating bladder 
disease. Various related conditions where these markers may be useful also, include, e.g., 
carcinoma in situ, various stages of papillary carcinomas; and such conditions in different 
stages, layers, structural portions, etc. 
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Recent advances in molecular medicine, generally, have increased the interest in 
tumor-specific cell surface antigens that could serve as diagnostic or prognostic markers, or 
as targets for various immunotherapeutic or small molecule strategies. Antigens suitable for 
immunotherapeutic strategies should be highly expressed in cancer tissues and ideally not 
5 expressed in other, e.g., normal, adult tissues. Expression in tissues that are dispensable for 
life, however, may be tolerated, as a physiological consequence of such expression would be 
limited. Examples of such antigens in cancers other than bladder cancer include Her2/neu 
and the B-cell antigen CD20. Humanized monclonal antibodies directed to Her2/neu 
(Herceptin®/trastuzumab) are currently in use for the treatment of metastatic breast cancer. 
10 Ross and Fletcher (1998) Stem Cells 16:413-428. Similarly, anti-CD20 monoclonal 
antibodies (Rituxin®/rituximab) are used to effectively treat non-Hodgkin's lymphoma. 
Maloney, et al. (1997) Blood 90:2188-2195; and Leget and Czuczman (1998) Curr. Opin. 
Oncol. 10:548-551. 

15 Definitions 

The term "bladder cancer protein" or "bladder cancer polynucleotide" or ,r bladder 
cancer-associated transcript" refers to nucleic acid and polypeptide polymorphic variants, 
alleles, mutants, and interspecies homologues that: (1) have a nucleotide sequence that has 
greater than about 60% nucleotide sequence identity, 65%, 70%, 75%, 80%, 85%, 90%, 

20 preferably 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% or greater nucleotide 

sequence identity, preferably over a region of over a region of at least about 25, 50, 1 00, 200, 
500, 1000, or more nucleotides, to a nucleotide sequence of or associated with a gene of 
Tables 1A-13; (2) bind to antibodies, e.g., polyclonal antibodies, raised against an 
immunogen comprising an amino acid sequence encoded by a nucleotide sequence of or 

25 associated with a gene of Tables 1 A-13, and conservatively modified variants thereof; (3) 
specifically hybridize under stringent hybridization conditions to a nucleic acid sequence, or 
the complement thereof of Tables 1A-13 and conservatively modified variants thereof; or (4) 
have an amino acid sequence that has greater than about 60% amino acid sequence identity, 
65%, 70%, 75%, 80%, 85%, 90%, preferably 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 

30 99% or greater amino sequence identity, preferably over a region of over a region of at least 
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about 25, 50, 100, 200, 500, 1000, or more amino acid, to an amino acid sequence encoded 
by a nucleotide sequence of or associated with a gene of Tables 1A-13. A polynucleotide or 
polypeptide sequence is typically from a mammal including, but not limited to, primate, e.g., 
human; rodent, e.g., rat, mouse, hamster; cow, pig, horse, sheep, or other mammal. A 
5 "bladder cancer polypeptide" and a ,r bladder cancer polynucleotide," include both naturally 
occurring or recombinant forms. 

A "full length" bladder cancer protein or nucleic acid refers to a bladder cancer 
polypeptide or polynucleotide sequence, or a variant thereof, that contains all of the elements 
normally contained in one or more naturally occurring, wild type bladder cancer 

10 polynucleotide or polypeptide sequences. The "full length" may be prior to, or after, various 
stages of splicing, including alternative splicing, or post-translation processing. 

"Biological sample" as used herein is a sample of biological tissue or fluid, e.g., that 
contains nucleic acids or polypeptides of a bladder cancer protein, polynucleotide, or 
transcript. Such samples include, but are not limited to, tissue isolated from primates, e.g., 

15 humans, or rodents, e.g., mice and rats. Biological samples may also include sections of 
tissues such as biopsy and autopsy samples, frozen sections taken for histologic purposes, 
blood, plasma, serum, sputum, stool, urine, tears, mucus, hair, skin, etc. Biological samples 
also include explants and primary and/or transformed cell cultures derived from patient 
tissues. A biological sample is typically obtained from a eukaryotic organism, most 

20 preferably a mammal such as a primate, e.g., chimpanzee or human; cow; dog; cat; a rodent, 
e.g., guinea pig, rat, or mouse; rabbit; or a bird; reptile; or fish. 

"Providing a biological sample" means to obtain a biological sample for use in 
methods described in this invention. Most often, this will be done by removing a sample of 
cells from an animal, but can also be accomplished by using previously isolated cells (e.g., 

25 isolated by another person, at another time, and/or for another purpose), or by performing the 
methods of the invention in vivo. Archival tissues, having treatment or outcome history, will 
be particularly useful. 

The terms "identical" or percent "identity," in the context of two or more nucleic acids 
or polypeptide sequences, refer to two or more sequences or subsequences that are the same 

30 or have a specified percentage of amino acid residues or nucleotides that are the same (e.g., 

10 
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about 60% identity, preferably 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 
96%, 97%, 98%, 99%, or higher identity over a specified region, when compared and aligned 
for maximum correspondence over a comparison window or designated region) as measured 
using a BLAST or BLAST 2.0 sequence comparison algorithms with default parameters 
5 described below, or by manual alignment and visual inspection (see, e.g., NCBI web site 
http://www.ncbi.nlm.nih.gov/BLAST/ or the like). Such sequences are then said to be 
"substantially identical." This definition also refers to, or may be applied to, the compliment 
of a test sequence. The definition also includes sequences that have deletions and/or 
additions, substitutions, naturally occurring variants, e.g., polymorphic or allelic, and man- 

1 0 made variants. As described below, the preferred algorithms can account for gaps and the 
like. Preferably, identity exists over a region that is at least about 25 amino acids or 
nucleotides in length, or more preferably over a region that is 50-100 amino acids or 
nucleotides in length. 

For sequence comparison, typically one sequence acts as a reference sequence, to 

15 which test sequences are compared. When using a sequence comparison computer algorithm, 
test and reference sequences, subsequence coordinates, and sequence algorithm program 
parameters are typically designated. Default or alternative program parameters can be 
selected. The sequence comparison algorithm then calculates the percent sequence identities 
for the test sequences relative to the reference sequence, based on the program parameters. 

20 A "comparison window", as used herein, includes reference to a segment of one of the 

number of contiguous positions selected from the group consisting typically of from about 
20-600, usually about 50-200, more usually about 100-150 in which a sequence may be 
compared to a reference sequence of the same number of contiguous positions after the two 
sequences are optimally aligned. Methods of alignment of sequences for comparison are 

25 well-known in the art. Optimal alignment of sequences for comparison can be conducted, by, 
e.g., the local homology algorithm of Smith and Waterman (1981) Adv. Appl. Math. 2:482, 
the homology alignment algorithm of Needleman and Wunsch (1970) J. Mol. Biol. 48:443- 
453, the search for similarity method of Pearson and Lipman (1988) Proc. Nat'l Acad. Sci. 
USA 85:2444-448, computerized implementations of these algorithms (GAP, BESTFIT, 

30 FAST A, and TFASTA in the Wisconsin Genetics Software Package, Genetics Computer 



WO 03/003906 



PCT/US02/21338 



Group, 575 Science Dr., Madison, WI), or manual alignment and visual inspection (see, e.g., 
Ausubel, et al. (eds. 1995 and supplements) Current Protocols in Molecular Biology 
Lippincott. 

Preferred algorithms suitable for determining percent sequence identity and sequence 
5 similarity include the BLAST and BLAST 2.0 algorithms. See Altschul, et al. (1977) Nuc, 
Acids Res. 25:3389-3402; and Altschul, et al. (1990) J. Mol. Biol. 215:403-410. BLAST and 
BLAST 2.0 are used, with the parameters described herein, to determine percent sequence 
identity for the nucleic acids and proteins of the invention. Software for performing BLAST 
analyses is publicly available through the National Center for Biotechnology Information 

10 (http://www.ncbi.nlm.nih.gov/). This algorithm involves first identifying high scoring 

sequence pairs (HSPs) by identifying short words of length W in the query sequence, which 
either match or satisfy some positive-valued threshold score T when aligned with a word of 
the same length in a database sequence. T is referred to as the neighborhood word score 
threshold (Altschul, et al, supra). These initial neighborhood word hits act as seeds for 

15 initiating searches to find longer HSPs containing them. The word hits are extended in both 
directions along each sequence for as far as the cumulative alignment score can be increased. 
Cumulative scores are calculated using, e.g., for nucleotide sequences, the parameters M 
(reward score for a pair of matching residues; always > 0) and N (penalty score for 
mismatching residues; always < 0). For amino acid sequences, a scoring matrix is used to 

20 calculate the cumulative score. Extension of the word hits in each direction are halted when: 
the cumulative alignment score falls off by the quantity X from its maximum achieved value; 
the cumulative score goes to zero or below, due to the accumulation of one or more negative- 
scoring residue alignments; or the end of either sequence is reached. The BLAST algorithm 
parameters W, f , and X determine the sensitivity and speed of the alignment. The BLASTN 

25 program (for nucleotide sequences) uses as defaults a wordlength (W) of 1 1, an expectation 
(E) of 10, M=5, N=-4, and a comparison of both strands. For amino acid sequences, the 
BLASTP program uses as defaults a wordlength of 3, expectation (E) of 10, and the 
BLOSUM62 scoring matrix (see Henikoff and Henikoff (1989) Proc. Natl Acad. Sci. USA 
89:10915-919) alignments (B) of 50, expectation (E) of 10, M=5, N— 4, and a comparison of 

30 both strands. 



WO 03/003906 



PCT/US02/21338 



The BLAST algorithm also performs a statistical analysis of the similarity between 
two sequences (see, e.g., Karlin and Altschul (1993) Proc. Natl Acad. Sci. USA 90:5873- 
5787). One measure of similarity provided by the BLAST algorithm is the smallest sum 
probability (P(N)), which provides an indication of the probability by which a match between 
5 two nucleotide or amino acid sequences would occur by chance. For example, a nucleic acid 
is considered similar to a reference sequence if the smallest sum probability in a comparison 
of the test nucleic acid to the reference nucleic acid is less than about 0.2, more preferably 
less than about 0.01 , and most preferably less than about 0.001 . Log values may be large 
negative numbers, e.g., 5, 10, 20, 30, 40, 40, 70, 90, 1 10, 150, 170, etc. 

10 An indication that two nucleic acid sequences or polypeptides are substantially 

identical is that the polypeptide encoded by the first nucleic acid is immunologically cross 
reactive with the antibodies raised against the polypeptide encoded by the second nucleic 
acid, as described below. Thus, a polypeptide is typically substantially identical to a second 
polypeptide, e.g., where the two peptides differ only by conservative substitutions. Another 

15 indication that two nucleic acid sequences are substantially identical is that the two molecules 
or their complements hybridize to each other under stringent conditions, as described below. 
Yet another indication that two nucleic acid sequences are substantially identical is that the 
same primers can be used to amplify the sequences. 

A "host cell" is a naturally occurring cell or a transformed cell that contains an 

20 expression vector and supports the replication or expression of the expression vector. Host 
cells may be cultured cells, explants, cells in vivo, and the like. Host cells may be 
prokaryotic cells such as E. coli, or eukaryotic cells such as yeast, insect, amphibian, or 
mammalian cells such as CHO, HeLa, and the like (see, e.g., the American Type Culture 
Collection catalog or web site, www.atcc.org). 

25 The terms "isolated," "purified," or "biologically pure" refer to material that is 

substantially or essentially free from components that normally accompany it as found in its 
native state. Purity and homogeneity are typically determined using analytical chemistry 
techniques such as polyacrylamide gel electrophoresis or high performance liquid 
chromatography. A protein or nucleic acid that is the predominant species present in a 

30 preparation is substantially purified. In particular, an isolated nucleic acid is separated from 
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some open reading frames that naturally flank the gene and encode proteins other than protein 
encoded by the gene. The term "purified" in some embodiments typically denotes that a 
nucleic acid or protein gives rise to essentially one band in an electrophoretic gel Preferably, 
it means that the nucleic acid or protein is at least 85% pure, more preferably at least 95% 
5 pure, and most preferably at least 99% pure. "Purify" or "purification" in other embodiments 
means removing at least one contaminant from the composition to be purified. In this sense, 
purification does not require that the purified compound be homogeneous, e.g., 100% pure. 

The terms "polypeptide," "peptide," and "protein" are used interchangeably herein to 
refer to a polymer of amino acid residues. The terms apply to amino acid polymers in which 

10 at least one amino acid residue is an artificial chemical mimetic of a corresponding naturally 
occurring amino acid, as well as to naturally occurring amino acid polymers, those containing 
modified residues, and a non-naturally occurring amino acid polymer. 

The term "amino acid" embraces naturally occurring or synthetic amino acids, amino 
acid analogs, and amino acid mimetics. Naturally occurring amino acids are those encoded 

1 5 by the genetic code, as well as those amino acids that are later modified, e.g., hydroxyproline, 
y-carboxyglutamate, and O-phosphoserine. Amino acid analogs include compounds that 
share a basic chemical structure with a naturally occurring amino acid, e.g., an a carbon that 
is bound to a hydrogen, a carboxyl group, an amino group, or an R group, e.g., homoserine, 
norleucine, methionine sulfoxide, methionine methyl sulfonium. Such analogs may have 

20 modified R groups (e.g., norleucine) or modified peptide backbones, but share some basic 
chemical structure with a naturally occurring amino acid. Amino acid mimetics include 
chemical compounds that have a structure that is different from the general chemical 
structure of an amino acid, but that function similarly to a naturally occurring amino acid. 

Amino acids may be referred to herein by their commonly known three letter symbols 

25 or by the one-letter symbols recommended by the IUPAC-IUB Biochemical Nomenclature 
Commission. Nucleotides, likewise, may be referred to by their commonly accepted single- 
letter codes. 

"Conservatively modified variants" applies to amino acid or nucleic acid sequences. 
With respect to particular nucleic acid sequences, conservatively modified variants refers to 
30 those nucleic acids which encode identical or essentially identical amino acid sequences, or 
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where the nucleic acid does not encode an amino acid sequence, to essentially identical or 
associated, e.g., naturally contiguous, sequences. Because of the degeneracy of the genetic 
code, a large number of functionally identical nucleic acids encode most proteins. For 
instance, the codons GCA, GCC, GCG, and GCU all encode the amino acid alanine. Thus, at 
5 each position where an alanine is specified by a codon, the codon can be altered to another of 
the corresponding codons described without altering the encoded polypeptide. Such nucleic 
acid variations are "silent variations," which are one species of conservatively modified 
variations. Each nucleic acid sequence herein which encodes a polypeptide also describes 
silent variations of the nucleic acid. One of skill will recognize that in certain contexts each 

10 codon in a nucleic acid (except AUG, which is ordinarily the only codon for methionine, and 
TGG, which is ordinarily the only codon for tryptophan) can be modified to yield a 
functionally identical molecule. Accordingly, often silent variations of a nucleic acid which 
encodes a polypeptide is implicit in a described sequence with respect to the expression 
product, but not with respect to actual probe sequences. 

15 As to amino acid sequences, one of skill will recognize that individual substitutions, 

deletions, or additions to a nucleic acid, peptide, polypeptide, or protein sequence which 
alters, adds, or deletes a single amino acid or a small percentage of amino acids in the 
encoded sequence is a "conservatively modified variant" where the alteration results in the 
substitution of an amino acid with a chemically similar amino acid. Conservative substitution 

20 tables providing functionally similar amino acids are well known in the art. Such 

conservatively modified variants are in addition to and do not exclude polymorphic variants, 
interspecies homologs, and alleles of the invention. Typically conservative substitutions for 
one another include: 1) Alanine (A), Glycine (G); 2) Aspartic acid (D), Glutamic acid (E); 3) 
Asparagine (N), Glutamine (Q); 4) Arginine (R), Lysine (K); 5) Isoleucine (I), Leucine (L), 

25 Methionine (M), Valine (V); 6) Phenylalanine (F), Tyrosine (Y), Tryptophan (W); 7) Serine 
(S), Threonine (T); and 8) Cysteine (C), Methionine (M). See, e.g., Creighton (1984) 
Proteins: Structure and Molecular Properties Freeman. 

Macromolecular structures such as polypeptide structures can be described in terms of 
various levels of organization. See, e.g., Alberts, et al. (eds. 2001) Molecular Biology of the 

30 Cell (4th ed.) Garland; and Cantor and Schimmel (1980) Biophysical Chemistry Part I: The 
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Conformation of Biological Macromolecules Freeman. "Primary structure" refers to the 
amino acid sequence of a particular peptide. "Secondary structure" refers to locally ordered, 
three dimensional structures within a polypeptide. These structures are commonly known as 
domains, which are portions of a polypeptide that often form a compact unit of the 
5 polypeptide, and are typically about 25-500 amino acids long. Typical domains are made of 
sections of lesser organization such as stretches of P-sheet and a-helices. "Tertiary structure" 
refers to the complete three dimensional structure of a polypeptide monomer. "Quaternary 
structure" refers to the three dimensional structure formed, usually by the nbncovalent 
association of independent tertiary units. Anisotropic terms are also known as energy terms. 

10 "Nucleic acid" or "oligonucleotide" or "polynucleotide" or grammatical equivalents 

used herein means at least two nucleotides covalently linked together. Oligonucleotides are 
typically from about 5, 6, 7, 8, 9, 10, 12, 15, 25, 30, 40, 50 or more nucleotides in length, up 
to about 100 nucleotides in length. Nucleic acids and polynucleotides are polymers, 
including longer lengths, e.g., 200, 300, 500, 1000, 2000, 3000, 5000, 7000, 10,000, etc. A 

1 5 nucleic acid of the present invention will generally contain phosphodiester bonds. In some 
cases, nucleic acid analogs are included that may have alternate backbones, e.g., 
phosphoramidate (Beaucage, et al. (1993) Tetrahedron 49: 1925-963 and references therein; 
Letsinger (1970) J. Org. Chem. 35:3800-803; Sprinzl, et al. (1977) Eur. J. Biochem. 81:579- 
589; Letsinger, et al. (1986) Nucl. Acids Res. 14:3487-499; Sawai, et al. (1984) Chem. Lett. 

20 805; Letsinger, et al. (1988) J. Am. Chem. Soc. 1 10:4470-471; and Pauwels, et al. (1986) 
Chemica Scripta 26:141-149); phosphorothioate (Mag, et al. (1991), Nucleic Acids Res. 
19:1437-441; and U.S. Patent No. 5,644,048); phosphorodithioate (Brill, et al. (1989) J. Am. 
Chem. Soc. 111:2321-322); O-methylphophoroamidite linkages (see Eckstein (1992) 
Oligonucleotides and Analogues: A Practical Approach Oxford Univ. Press); and peptide 

25 nucleic acid backbones and linkages (see Egholm (1 992) J. Am. Chem. Soc. 114:1 895-897; 
Meier, et al. (1992) Chem. hit. Ed. Engl. 31:1008-010; Nielsen (1993) Nature 365:566-568; 
Carlsson, et al. (1996) Nature 380:207. Other analog nucleic acids include those with 
positively charged backbones(Denpcy, et al. (1995) Proc. Natl Acad. Sci. USA 92:6097- 
101); non-ionic backbones (U.S. Patent Nos. 5,386,023; 5,637,684; 5,602,240; 5,216,141; 

30 and 4,469,863; Kiedrowshi, et al. (1991) Aneew. Chem. Intl. Ed. English 30:423-426; 
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Letsinger, et al. (1988) J. Am. Chem. Soc, 1 10:4470-471; Jung, et al. (1994) Nucleoside and 
Nucleotide 13:1597-xxx; Chapters 2-3 of Sanghvi and Cook (eds. 1994) Carbohydrate 
Modifications in Antisense Research ACS Symposium Series 580; Mesmaeker, et al. (1994) 
Bioorganic and Medicinal Chem. Lett. 4:395-398; Jeffs, et al. (1994) J. Biomolecular NMR 
5 34:17; Horn, et al. (1996) Tetrahedron Lett. 37:743-xxx); and non-ribose backbones (see U.S. 
Patent Nos. 5,235,033 and 5,034,506, and Chapters 6-7 of Sanghvi and Cook (eds. 1994) 
Carbohydrate Modifications in Antisense Research ACS Symposium Series 580. Nucleic 
acids containing one or more carbocyclic sugars are also contemplated. See Jenkins and 
Turner (1995) Chem. Soc. Rev. 24:169-176. Several nucleic acid analogs are described in 

10 Rawls (page 35, June 2, 1997) C&E News . Modifications of the ribose-phosphate backbone 
may be made, e.g., to increase the stability and half-life of such molecules in physiological 
environments or as probes on a biochip. Mixtures of naturally occurring nucleic acids and 
analogs can be made; alternatively, mixtures of different nucleic acid analogs, and mixtures 
of naturally occurring nucleic acids and analogs may be made. 

15 Particularly preferred are peptide nucleic acids (PNA) which include peptide nucleic 

acid analogs. These backbones are substantially non-ionic under neutral conditions, in 
contrast to the highly charged phosphodiester backbone of naturally occurring nucleic acids. 
The PNA backbone typically exhibits improved hybridization kinetics, exhibiting larger 
changes in the melting temperature (Tm) for mismatched versus perfectly matched basepairs, 

20 DNA and RNA typically exhibit a 2-4° C drop in T m for an internal mismatch. With the 
non-ionic PNA backbone, the drop is closer to 7-9° C. And due to their non-ionic nature, 
hybridization of the polymers is relatively insensitive to salt concentration. In addition, 
PNAs are not as easily degraded by cellular enzymes, and can be more stable. 

The nucleic acids may be single stranded or double stranded, or contain portions of 

25 both double stranded or single stranded sequence. As will be appreciated by those in the art, 

the depiction of a single strand also defines the sequence of the complementary strand; thus 

the sequences described herein also provide the complement of the sequence. The nucleic 

acid may be DNA, both genomic and cDNA, RNA, or a hybrid, where the nucleic acid may 

contain combinations of deoxyribo- and ribo-nucleotides, and combinations of bases, 

30 including uracil, adenine, thymine, cytosine, guanine, inosine, xanthine, hypoxanthine, 
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isocytosine, isoguanine, etc. "Transcript" typically refers to a naturally occurring RNA, e.g., 
a pre-mRNA, hnRNA, or mRNA. As used herein, the term "nucleoside" includes nucleotides 
and nucleoside and nucleotide analogs, and modified nucleosides such as amino modified 
nucleosides. In addition, "nucleoside" includes non-naturally occurring analog structures. 
5 Thus, e.g., the individual units of a peptide nucleic acid, each containing a base, are referred 
to herein as a nucleoside. 

A "label" or "detectable moiety" is a composition detectable by spectroscopic, 
photochemical, biochemical, immunochemical, chemical, or other physical means. Direct or 
indirect methods are comtemplated. For example, useful labels include 32p 9 fluorescent 

10 dyes, electron-dense reagents, enzymes (e.g., as commonly used in an ELISA), biotin, 

digoxigenin, or haptens and proteins or other entities which can be made detectable, e.g., by 
incorporating a radiolabel into the peptide or used to detect antibodies specifically reactive 
with the peptide. The labels may be incorporated into the bladder cancer nucleic acids, 
proteins, and antibodies. Methods are well known for conjugating the antibody to the label, 

1 5 including those methods described by Hunter, et al. (1962) Nature 144:945-946; David, et al. 
(1974) Biochemistry 13:1014-021; Pain, et al. Q98n J. Immunol. Meth. 40:219-230; and 
Nygren (1982) J. Histochem. and Cvtochem. 30:407-412. 

An "effector" or "effector moiety" or "effector component" is a molecule that is bound 
(or linked, or conjugated), either covalently, through a linker or a chemical bond, or 

20 noncovalently, through ionic, van der Waals, electrostatic, or hydrogen bonds, to a target, 
e.g., an antibody. The "effector" can be a variety of molecules including, e.g., detection 
moieties including radioactive compounds; fluorescent compounds; an enzyme or substrate; 
tags such as epitope tags; a toxin; activatable moieties; a chemotherapeutic agent; a lipase; an 
antibiotic; a radioisotope emitting "hard", e.g., beta radiation; or an attracting moiety. 

25 A "labeled nucleic acid probe or oligonucleotide" is one that is bound, either 

covalently, through a linker or a chemical bond, or noncovalently, through ionic, van der 
Waals, electrostatic, or hydrogen bonds to a label such that the presence of the probe may be 
detected by detecting the presence of the label bound to the probe. Alternatively, method 
using high affinity interactions may achieve the same results where one of a pair of binding 

30 partners binds to the other, e.g., biotin, streptavidin. 
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As used herein a "nucleic acid probe or oligonucleotide" is defined as a nucleic acid 
capable of binding to a target nucleic acid of complementary sequence through one or more 
types of chemical bonds, usually through complementary base pairing, usually through 
hydrogen bond formation. As used herein, a probe may include natural (e.g., A, G, C, or T) 
5 or modified bases (7-deazaguanosine, inosine, etc.). In addition, the bases in a probe may be 
joined by a linkage other than a phosphodiester bond, so long as it does not functionally 
interfere with hybridization. Thus, e.g., probes may be peptide nucleic acids in which the 
constituent bases are joined by peptide bonds rather than phosphodiester linkages. Probes 
may bind target sequences lacking complete complementarity with the probe sequence 

10 depending upon the stringency of the hybridization conditions. The probes are preferably 
directly labeled as with isotopes, chromophores, lumiphores, chromogens, or indirectly 
labeled such as with biotin to which streptavidin linked label may bind. By assaying for the 
presence or absence of the probe, one can detect the presence or absence of the select 
sequence or subsequence. Diagnosis or prognosis may be based at the genomic level, or at 

1 5 the level of RNA or protein expression. 

The term "recombinant" when used with reference, e.g., to a cell, or nucleic acid, 
protein, or vector, indicates that the cell, nucleic acid, protein or vector, has been modified by 
the introduction of a heterologous nucleic acid or protein or the alteration of a native nucleic 
acid or protein, or that the cell is derived from a cell so modified. Thus, e.g., recombinant 

20 cells express genes that are not found within the native (non-recombinant) form of the cell or 
express native genes that are otherwise abnormally expressed, under expressed, or not 
expressed at all. By the term "recombinant nucleic acid" herein is meant nucleic acid, 
originally formed in vitro, in general, by the manipulation of nucleic acid, e.g., using 
polymerases and endonucleases, in a form not normally found in nature. In this manner, 

25 operable linkage of different sequences is achieved. Thus an isolated nucleic acid, in a linear 
form, or an expression vector formed in vitro by ligating DNA molecules that are not 
normally joined, are both considered recombinant for the purposes of this invention. It is 
understood that once a recombinant nucleic acid is made and reintroduced into a host cell or 
organism, it will replicate non-recombinantly, e.g., using in vivo cellular machinery of the 

30 host cell rather than in vitro manipulations; however, such nucleic acids, once produced 
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recombinantly, although subsequently replicated non-recombinantly, are still considered 
recombinant for the purposes of the invention. Similarly, a "recombinant protein" is a protein 
made using recombinant techniques, e.g., through the expression of a recombinant nucleic 
acid as depicted above. 

5 The tenn "heterologous" when used with reference to portions of a nucleic acid 

indicates that the nucleic acid comprises two or more subsequences that are not normally 
found in the same relationship to each other in nature. For instance, the nucleic acid is 
typically recombinantly produced, having two or more sequences, e.g., from unrelated genes 
arranged to make a new functional nucleic acid, e.g., a promoter from one source and a 
10 coding region from another source. Similarly, a heterologous protein will often refer to two 
or more subsequences that are not found in the same relationship to each other in nature (e.g., 
a fusion protein). 

A "promoter" is defined as an array of nucleic acid control sequences that direct 
transcription of a nucleic acid. As used herein, a promoter includes necessary nucleic acid 

15 sequences near the start site of transcription, such as, in the case of a polymerase II type 

promoter, a TATA element. A promoter also optionally includes distal enhancer or repressor 
elements, which can be located as much as several thousand base pairs from the start site of 
transcription. A "constitutive" promoter is a promoter that is active under most 
environmental and developmental conditions. An "inducible" promoter is a promoter that is 

20 active under environmental or developmental regulation. The term "operably linked" refers 
to a functional linkage between a nucleic acid expression control sequence (such as a 
promoter, or array of transcription factor binding sites) and a second nucleic acid sequence, 
wherein the expression control sequence directs transcription of the nucleic acid 
corresponding to the second sequence. 

25 An "expression vector" is a nucleic acid construct, generated recombinantly or 

synthetically, with a series of specified nucleic acid elements that permit transcription of a 
particular nucleic acid in a host cell. The expression vector can be part of a plasmid, virus, or 
nucleic acid fragment. Typically, the expression vector includes a nucleic acid to be 
transcribed operably linked to a promoter. 
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The phrase "selectively (or specifically) hybridizes to" refers to the binding, 
duplexing, or hybridizing of a molecule only to a particular nucleotide sequence under 
stringent hybridization conditions when that sequence is present in a complex mixture (e.g., 
total cellular or library DNA or RNA). 
5 The phrase "stringent hybridization conditions" refers to conditions under which a 

probe will hybridize to its target subsequence, typically in a complex mixture of nucleic 
acids, but to no other sequences. Stringent conditions are sequence-dependent and will be 
different in different circumstances. Longer sequences hybridize specifically at higher 
temperatures. An extensive guide to the hybridization of nucleic acids is found in "Overview 

10 of principles of hybridization and the strategy of nucleic acid assays" in Tijssen (1993) 

Hybridization with Nucleic Probes (Techniques in Biochemistry and Molecular Biology; vol. 
24) Elsevier. Generally, stringent conditions are selected to be about 5-10° C lower than the 
thermal melting point (Tm) for the specific sequence at a defined ionic strength pH. The T m 
is the temperature (under defined ionic strength, pH, and nucleic acid concentration) at which 

1 5 50% of the probes complementary to the target hybridize to the target sequence at 

equilibrium (as the target sequences are present in excess, at T m , 50% of the probes are 
occupied at equilibrium). Stringent conditions will be those in which the salt concentration is 
less than about 1.0 M sodium ion, typically about 0.01 to 1.0 M sodium ion concentration (or 
other salts) at pH 7.0 to 8.3 and the temperature is at least about 30° C for short probes (e.g., 

20 about 10-50 nucleotides) and at least about 60° C for long probes (e.g., greater than about 50 
nucleotides). Stringent conditions may also be achieved with the addition of destabilizing 
agents such as formamide. For selective or specific hybridization, a positive signal is at least 
about two times background, preferably about 10 times background hybridization. 
Exemplary stringent hybridization conditions can be as following: 50% formamide, 5x SSC, 

25 and 1% SDS, incubating at 42° C, or, 5x SSC, 1% SDS, incubating at 65° C, with wash in 
0.2x SSC, and 0.1% SDS at 65° C. For PCR, a temperature of about 36° C is typical for low 
stringency amplification, although annealing temperatures may vary between about 32-48° C 
depending on primer length. For high stringency PGR amplification, a temperature of about 
62° C is typical, although high stringency annealing temperatures can range from about 50- 
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65° C, depending on the primer length and specificity. Typical cycle conditions for both high 
and low stringency amplifications include a denaturation phase of 90-95° C for 30-120 sec, 
an annealing phase lasting 30-120 sec, and an extension phase of about 72° C for 1-2 min. 
Protocols and guidelines for low and high stringency amplification reactions are provided, 
5 e.g., in Innis, et al. (1990) PCR Protocols, A Guide to Methods and Applications Academic 
Press NY. 

Nucleic acids that do not hybridize to each other under stringent conditions are still 
substantially identical if the polypeptides which they encode are substantially identical. This 
occurs, e.g., when a copy of a nucleic acid is created using the maximum codon degeneracy 

1 0 permitted by the genetic code. In such cases, the nucleic acids typically hybridize under 
moderately stringent hybridization conditions. Exemplary "moderately stringent 
hybridization conditions" include a hybridization in a buffer of 40% formamide, 1 M NaCl, 
1% SDS at 37° C, and a wash in IX SSC at 45° C. A positive hybridization is at least about 
twice background. Alternative hybridization and wash conditions can be utilized to provide 

1 5 conditions of similar stringency. Additional guidelines for determining hybridization 

parameters are provided in numerous references, e.g.,. Ausubel, et al. Current Protocols in 
Molecular Biology Lippincott. 

The phrase "functional effects" in the context of assays for testing compounds that 
modulate activity of a bladder cancer protein includes the determination of a parameter that is 

20 indirectly or directly under the influence of the bladder cancer protein or nucleic acid, e.g., a 
functional, physical, or chemical effect, such as the ability to decrease bladder cancer. It 
includes Hgand binding activity; cell growth on soft agar; anchorage dependence; contact 
inhibition and density limitation of growth; cell viability, cellular proliferation; cellular 
transformation; growth factor or serum dependence; tumor specific marker levels; 

25 invasiveness into Matrigel; tumor growth and metastasis in vivo; mRNA and protein 

expression in cells undergoing metastasis, and other characteristics of bladder cancer cells. 
"Functional effects" include in vitro, in vivo, and ex vivo activities. 

By "determining the functional effect" is meant assaying for a compound that 
increases or decreases a parameter that is indirectly or directly under the influence of a 

30 bladder cancer protein sequence, e.g., functional, enzymatic, physical and chemical effects. 
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Such functional effects can be measured by many means known to those skilled in the art, 
e.g., changes in spectroscopic characteristics (e.g., fluorescence, absorbance, refractive 
index), hydrodynamic (e.g., shape), chromatographic, or solubility properties for the protein, 
measuring inducible markers or transcriptional activation of the bladder cancer protein; 
5 measuring binding activity or binding assays, e.g., binding to antibodies or other ligands, and 
measuring cellular proliferation or metabolism. Determination of the functional effect of a 
compound on bladder cancer can also be performed using bladder cancer assays, such as, in 
vitro assays, e.g., cell growth on soft agar; anchorage dependence; contact inhibition and 
density limitation of growth; cellular proliferation; cellular transformation; growth factor or 

10 serum dependence; tumor specific marker levels; invasiveness into Matrigel; tumor growth 
and metastasis in vivo; mRNA and protein expression in cells undergoing metastasis, and 
other characteristics of bladder cancer cells. Functional effects can be evaluated by many 
means, e.g., microscopy for quantitative or qualitative measures of alterations in 
morphological features, measurement of changes in RNA or protein levels for bladder 

1 5 cancer-associated sequences, measurement of RNA stability, identification of downstream or 
reporter gene expression (CAT, Iuciferase, p-gal, GFP, and the like), e.g., via 
chemiluminescence, fluorescence, colorimetric reactions, antibody binding, inducible 
markers, and ligand binding assays. 

"Inhibitors", "activators", and "modulators" of bladder cancer polynucleotide and 

20 polypeptide sequences are used to refer to activating, inhibitory, or modulating molecules or 
compounds identified using in vitro and in vivo assays of bladder cancer polynucleotide and 
polypeptide sequences. Inhibitors are compounds that, e.g., bind to, partially or totally block 
activity, decrease, prevent, delay activation, inactivate, desensitize, or down regulate the 
activity or expression of bladder cancer proteins, e.g., antagonists. Antisense nucleic acids 

25 may seem to inhibit expression and subsequent function of the protein. "Activators" are 

compounds that increase, open, activate, facilitate, enhance activation, sensitize, agonize, or 
up regulate bladder cancer protein activity. Inhibitors, activators, or modulators also include 
genetically modified versions of bladder cancer proteins, e.g., versions with altered activity, 
as well as naturally occurring and synthetic ligands, antagonists, agonists, antibodies, small 

30 chemical molecules and the like. Such assays for inhibitors and activators include, e.g., 
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expressing the bladder cancer protein in vitro, in cells, or cell membranes, applying putative 
modulator compounds, and then determining the functional effects on activity, as described 
above. Activators and inhibitors of bladder cancer can also be identified by incubating 
bladder cancer cells with the test compound and determining increases or decreases in the 
5 expression of 1 or more bladder cancer proteins, e.g., 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, 40, 50 or 
more bladder cancer proteins, such as bladder cancer proteins encoded by the sequences set 
out in Tables 1A-13. 

Samples or assays comprising bladder cancer proteins that are treated with a potential 
activator, inhibitor, or modulator are compared to control samples without the inhibitor, 

1 0 activator, or modulator to examine the extent of inhibition. Control samples (untreated with 
inhibitors) are assigned a relative protein activity value of 100%. Inhibition of a polypeptide 
is achieved when the activity value relative to the control is about 80%, preferably about 
50%, more preferably about 25-0%. Activation of a bladder cancer polypeptide is achieved 
when the activity value relative to the control (untreated with activators) is about 1 10%, more 

15 preferably about 150%, more preferably about 200-500% (e.g., two to five fold higher 
relative to the control), more preferably about 1000-3000% higher. 

The phrase "changes in cell growth" refers to a change in cell growth and proliferation 
characteristics in vitro or in vivo, such as cell viability, formation of foci, anchorage 
independence, semi-solid or soft agar growth, changes in contact inhibition and density 

20 limitation of growth, loss of growth factor or serum requirements, changes in cell 

morphology, gaining or losing immortalization, gaining or losing tumor specific markers, 
ability to form or suppress tumors when injected into suitable animal hosts, and/or 
immortalization of the cell. See, e.g., pp. 231-241 of Freshney (1994) Culture of Animal 
Cells: A Manual of Basic Technique (3d ed.). 

25 "Tumor cell" refers to precancerous, cancerous, and normal cells in a tumor. 

"Cancer cells," "transformed" cells or "transformation" in tissue culture, refers to 
spontaneous or induced phenotypic changes that do not necessarily involve the uptake of new 
genetic material. Although transformation can arise from infection with a transforming virus 
and incorporation of new genomic DNA, or uptake of exogenous DNA, it can also arise 

30 spontaneously or following exposure to a carcinogen, thereby mutating an endogenous gene. 
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Transformation is associated with phenotypic changes, such as immortalization of cells, 
aberrant growth control, nonmorphological changes, and/or malignancy. See, Freshney 
(2000) Culture of Animal Cells: A Manual of Basic Technique (4th ed.) Wiley-Liss. 
"Antibody 11 refers to a polypeptide comprising a framework region from an 
5 immunoglobulin gene or fragments thereof that specifically binds and recognizes an antigen. 
The recognized immunoglobulin genes include the kappa, lambda, alpha, gamma, delta, 
epsilon, and mu constant region genes, as well as the myriad immunoglobulin variable region 
genes. Light chains are classified as either kappa or lambda. Heavy chains are classified as 
gamma, mu, alpha, delta, or epsilon, which in turn define the immunoglobulin classes, IgG, 

10 IgM, IgA, IgD and IgE, respectively. Typically, the antigen-binding region of an antibody or 
its functional equivalent will be most critical in specificity and affinity of binding. See Paul 
(ed. 1999) Fundamental Immunology (4th ed.) Raven. 

An exemplary immunoglobulin (antibody) structural unit comprises a tetramer. Each 
tetramer is composed of two identical pairs of polypeptide chains, each pair having one 

15 "light" (about 25 kD) and one "heavy" chain (about 50-70 kD). The N-terminus of each 

chain defines a variable region of about 100 to 1 10 or more amino acids primarily responsible 
for antigen recognition. The terms variable light chain (VjJ and variable heavy chain (Vfj) 

refer to these light and heavy chains respectively. 

Antibodies exist, e.g., as intact immunoglobulins or as a number of well-characterized 

20 fragments produced by digestion with various peptidases. Thus, e.g., pepsin digests an 

antibody below the disulfide linkages in the hinge region to produce F(ab)'2, a dimer of Fab 
which itself is a light chain joined to Vr-ChI by a disulfide bond. The F(ab)'2 may be 
reduced under mild conditions to break the disulfide linkage in the hinge region, thereby 
converting the F(ab) f 2 dimer into an Fab 1 monomer. The Fab 1 monomer is essentially Fab 

25 with part of the hinge region. See Paul (ed. 1999) Fundamental Immunology (4th ed.) Raven. 
While various antibody fragments are defined in terms of the digestion of an intact antibody, 
such fragments may be synthesized de novo either chemically or by using recombinant DNA 
methodology. Thus, the term antibody, as used herein, also includes antibody fragments 
either produced by the modification of whole antibodies, or those synthesized de novo using 
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recombinant DNA methodologies (e.g., single chain Fv) or those identified using phage 
display libraries. See, e.g., McCafferty, et al. (1990) Nature 348:552-554. 

For preparation of antibodies, e.g., recombinant, monoclonal, or polyclonal 
antibodies, many techniques can be used. See, e.g., Kohler and Milstein (1975) Nature 
5 256:495-497; Kozbor, et al. (1983) Immunology Today 4:72; Cole, et al. pp. 77-96 in 
vReisfeld and Sell f 1985) Monoclonal Antibodies and Cancer Therapy Liss; Coligan (1991) 
Current Protocols in Immunology Lippincott; Harlow and Lane (1988) Antibodies: A 
Laboratory Manual CSH Press; and Goding (1986) Monoclonal Antibodies: Principles and 
Practice (2d ed.) Academic Press. Techniques for the production of single chain antibodies 

10 (U.S. Patent 4,946,778) can be adapted to produce antibodies to polypeptides of this 

invention. Also, transgenic mice, or other organisms such as other mammals, may be used to 
express humanized antibodies. Alternatively, phage display technology can be used to 
identify antibodies and heteromeric Fab fragments that specifically bind to selected antigens. 
See, e.g., McCafferty, et al. (1990) Nature 348:552-554; and Marks, et al. (1992) 

15 Biotechnology 10:779-783. 

A "chimeric antibody" is an antibody molecule in which (a) the constant region, or a 
portion thereof, is altered, replaced, or exchanged so that the antigen binding site (variable 
region) is linked to a constant region of a different or altered class, effector function, and/or 
species, or an entirely different molecule which confers new properties to the chimeric 

20 antibody, e.g., an enzyme, toxin, hormone, growth factor, drug, etc.; or (b) the variable 

region, or a portion thereof, is altered, replaced, or exchanged with a variable region having a 
different or altered antigen specificity. 

Identification of bladder cancer-associated sequences 

25 In one aspect, the expression levels of genes are determined in different patient 

samples for which diagnosis information is desired, to provide expression profiles. An 
expression profile of a particular sample is essentially a "fingerprint" of the state of the 
sample; while two states may have a particular gene similarly expressed, the evaluation of a 
number of genes simultaneously allows the generation of a gene expression profile that is 

30 characteristic of the state of the cell. That is, normal tissue (e.g., normal bladder or other 
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tissue) may be distinguished from cancerous or metastatic cancerous tissue of the bladder, or 
bladder cancer tissue or metastatic bladder cancerous tissue can be compared with tissue 
samples of bladder and other tissues from surviving cancer patients. By comparing 
expression profiles of tissue in known different bladder cancer states, information regarding 
5 which genes are important (including both up- and down-regulation of genes) in each of these 
states is obtained. 

The identification of sequences that are differentially expressed in bladder cancer 
versus non-bladder cancer tissue allows the use of this information in a number of ways. For 
example, a particular treatment regime may be evaluated: does a chemotherapeutic drug act 

10 to down-regulate bladder cancer, and thus tumor growth or recurrence, in a particular patient; 
or does chemotherapy or radiation therapy induce expression of particular targets. Similarly, 
diagnosis and treatment outcomes may be done or confirmed by comparing patient samples 
with the known expression profiles. Metastatic tissue can also be analyzed to determine the 
stage of bladder cancer in the tissue or origin of a primary tumor. Furthermore, these gene 

15 expression profiles (or individual genes) allow screening of drug candidates with an eye to 
mimicking or altering a particular expression profile; e.g., screening can be done for drugs 
that suppress the bladder cancer expression profile. This may be done by making biochips 
comprising sets of important bladder cancer genes, which can then be used in these screens. 
These methods can also be applied on the protein basis; that is, protein expression levels of 

20 the bladder cancer proteins can be evaluated for diagnostic purposes or to screen candidate 
agents. In addition, the bladder cancer nucleic acid sequences can be administered for gene 
therapy purposes, including the administration of antisense or inhibitory nucleic acids, or the 
bladder cancer proteins (including antibodies and other modulators thereof) administered as 
therapeutic drugs. 

25 Thus the present invention provides nucleic acid and protein sequences that are 

differentially expressed in bladder disease or cancer relative to normal tissues and/or non- 
malignant bladder tissue, herein termed "bladder cancer sequences." As outlined below, 
bladder cancer sequences include those that are up-regulated (e.g., expressed at a higher 
level) in bladder cancer, as well as those that are down-regulated (e.g., expressed at a lower 

30 level). In a preferred embodiment, the bladder cancer sequences are from humans; however, 
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as will be appreciated by those in the art, bladder cancer sequences from other organisms may 
be useful in animal models of disease and drug evaluation; thus, other bladder cancer 
sequences are provided, from vertebrates, including mammals, including rodents (rats, mice, 
hamsters, guinea pigs, etc.), primates, farm animals (including sheep, goats, pigs, cows, 
5 horses, etc.) and pets, e.g., (dogs, cats, etc.). Bladder cancer sequences from other organisms 
may be obtained using the techniques outlined below. 

Bladder cancer sequences can include both nucleic acid and amino acid sequences. 
Bladder cancer nucleic acid sequences are useful in a variety of applications, including 
diagnostic applications, which will detect naturally occurring nucleic acids, as well as 

1 0 screening applications. Biochips comprising nucleic acid probes or PCR microtiter plates 
with selected probes to the bladder cancer sequences can be generated. 

A bladder cancer sequence can be initially identified by substantial nucleic acid 
and/or amino acid sequence homology to the bladder cancer sequences outlined herein. Such 
homology can be based upon the overall nucleic acid or amino acid sequence, and is 

1 5 generally determined as outlined below, using either homology programs or hybridization 
conditions. 

For identifying bladder cancer-associated sequences, the bladder cancer screen 
typically includes comparing genes identified in different tissues, e.g., normal, non- 
malignant, or cancerous tissues, or tumor tissue samples from patients who have metastatic 

20 disease vs. non metastatic tissue. Other suitable tissue comparisons include comparing 
bladder cancer samples with metastatic cancer samples from other cancers, such as lung, 
bladder, gastrointestinal cancers, ovarian, etc. Samples of different stages of bladder cancer, 
e.g., survivor tissue, drug resistant states, and tissue undergoing metastasis, are applied to 
biochips comprising nucleic acid probes. The samples are first microdissected, if applicable, 

25 and treated for the preparation of mRNA. Suitable biochips are commercially available, e.g., 
from Affymetrix. Gene expression profiles as described herein are generated and the data 
analyzed. 

In one embodiment, genes showing changes in expression as between normal and 
disease states are compared to genes expressed in other normal tissues, preferably normal 
30 bladder, but also including, and not limited to lung, heart, brain, liver, bladder, kidney, 
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muscle, colon, small intestine, large intestine, spleen, bone, and placenta. In a preferred 
embodiment, those genes identified during the bladder cancer screen that are expressed in a 
significant amount in other tissues are removed from the profile, although in some 
embodiments, this is not necessary. That is, when screening for drugs, it is usually preferable 
5 that the target be disease specific, e.g., not be expressed on critical organs. 

In a preferred embodiment, bladder cancer sequences are those that are up-regulated 
in bladder cancer; that is, the expression of these genes is higher in the bladder cancer tissue 
as compared to non-cancerous tissue. "Up-regulation" as used herein often means at least 
about a two-fold change, preferably at least about a three fold change, with at least about 

1 0 five-fold or higher being preferred. Unigene cluster identification numbers and accession 
numbers herein are for the GenBank sequence database and sequences of accession numbers 
are hereby expressly incorporated by reference. GenBank is known in the art, see, e.g., 
Benson, et al. (1998) Nuc. Acids Res. 26:1-7 and http://www.ncbi.nlm.nih.gov/. Sequences 
are also available in other databases, e.g., European Molecular Biology Laboratory (EMBL) 

1 5 and DNA Database of Japan (DDBJ). 

In another preferred embodiment, bladder cancer sequences are those that are down- 
regulated in the bladder cancer; that is, the expression of these genes is lower in bladder 
cancer tissue as compared to non-cancerous tissue (see, e.g., Tables 1 A-13). "Down- 
regulation" as used herein often means at least about a two-fold change, preferably at least 

20 about a three fold change, with at least about five- fold or higher being preferred. 

Informatics 

The ability to identify genes that are over or under expressed in bladder cancer can 
additionally provide high-resolution, high-sensitivity datasets which can be used in the areas 
25 of diagnostics, therapeutics, drug development, pharmacogenetics, protein structure, 

biosensor development, and other related areas. For example, the expression profiles can be 
used in diagnostic or prognostic evaluation of patients with bladder cancer. Or as another 
example, subcellular toxicological information can be generated to better direct drug structure 
and activity correlation. See Anderson (June 11-12, 1998) Pharmaceutical Proteomics: 
30 Targets, Mechanism, and Function, paper presented at the IBC Proteomics conference, 
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Coronado, CA. Subcellular toxicological information can also be utilized in a biological 
sensor device to predict the likely toxicological effect of chemical exposures and likely 
tolerable exposure thresholds (see U.S. Patent No. 5,81 1,231). Similar advantages accrue 
from datasets relevant to other biomolecules and bioactive agents (e.g., nucleic acids, 
5 saccharides, lipids, drugs, and the like). 

Thus, in another embodiment, the present invention provides a database that includes 
at least one set of assay data. The data contained in the database is acquired, e.g., using array 
analysis either singly or in a library format. The database can be in a form in which data can 
be maintained and transmitted, but is preferably an electronic database. The electronic 

1 0 database of the invention can be maintained on an electronic device allowing for the storage 
of and access to the database, such as a personal computer, but is preferably distributed on a 
wide area network, such as the World Wide Web. 

The focus of the present section on databases that include peptide sequence data is for 
clarity of illustration only. It will be apparent that similar databases can be assembled for 

1 5 assay data acquired using an assay of the invention. 

The compositions and methods for identifying and/or quantitating the relative and/or 
absolute abundance of a variety of molecular and macromolecular species from a biological 
sample undergoing bladder cancer, e.g., the identification of bladder cancer-associated 
sequences described herein, provide an abundance of information, which can be correlated 

20 with pathological conditions, predisposition to disease, drug testing, therapeutic monitoring, 
gene-disease causal linkages, identification of correlates of immunity and physiological 
status, among others. Although the data generated from the assays of the invention is suited 
for manual review and analysis, in a preferred embodiment, prior data processing using high- 
speed computers is utilized. 

25 An array of methods for indexing and retrieving biomolecular information is known 

in the art. For example, U.S. Patents 6,023,659 and 5,966,712 disclose a relational database 
system for storing biomolecular sequence information in a manner that allows sequences to 
be catalogued and searched according to one or more protein function hierarchies. U.S. 
Patent 5,953,727 discloses a relational database having sequence records containing 

30 information in a format that allows a collection of partial-length DNA sequences to be 
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catalogued and searched according to association with one or more sequencing projects for 
obtaining full-length sequences from the collection of partial length sequences. U.S. Patent 
5,706,498 discloses a gene database retrieval system for making a retrieval of a gene 
sequence similar to a sequence data item in a gene database based on the degree of similarity 
5 between a key sequence and a target sequence. U.S. Patent 5,538,897 discloses a method 
using mass spectroscopy fragmentation patterns of peptides to identify amino acid sequences 
in computer databases by comparison of predicted mass spectra with experimentally-derived 
mass spectra using a closeness-of-fit measure. U.S. Patent 5,926,818 discloses a multi- 
dimensional database comprising a functionality for multi-dimensional data analysis 

1 0 described as on-line analytical processing (OLAP), which entails the consolidation of 

projected and actual data according to more than one consolidation path or dimension. U.S. 
Patent 5,295,261 reports a hybrid database structure in which the fields of each database 
record are divided into two classes, navigational and informational data, with navigational 
fields stored in a hierarchical topological map which can be viewed as a tree structure or as 

15 the merger of two or more such tree structures. 

See also Mount, et al. (2001) Bioinformatics CSH Press, NY; Durbin, et al. (eds. 
1999) Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids 
Cambridge Univ. Press; Baxevanis and Oeullette (eds. 1998) Bioinformatics: A Practical 
Guide to the Analysis of Genes and Proteins (2d. ed.) Wiley-Liss; Rashidi and Buehler 

20 (1999) Bioinformatics: Basic Applications in Biological Science and Medicine CRC Press; 
Setubal, et al. (eds 1997) Introduction to Computational Molecular Biology Brooks/Cole; 
Misener and Krawetz (eds. 2000) Bioinformatics: Methods and Protocols Oxford Univ. 
Press; Higgins and Taylor (eds. 2000) Bioinformatics: Sequence, Structure, and Databanks: A 
Practical Approach Oxford Univ. Press; Brown (2001) Bioinformatics: A Biologist's Guide to 

25 Biocomputing and the Internet Eaton Pub.; Han and Kamber (2000) Data Mining: Concepts 
and Techniques Kaufinann Pub.; and Waterman (1995) Introduction to Computational 
Biology: Maps. Sequences, and Genomes Chap and Hall. 

The present invention provides a computer database comprising a computer and 
software for storing in computer-retrievable form assay data records cross-tabulated, e.g., 
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with data specifying the source of the target-containing sample from which each sequence 
specificity record was obtained. 

In an exemplary embodiment, at least one of the sources of target-containing sample 
is from a control tissue sample known to be free of pathological disorders. In a variation, at 
5 least one of the sources is a known pathological tissue specimen, e.g., a neoplastic lesion or 
another tissue specimen to be analyzed for bladder cancer. In another variation, the assay 
records cross-tabulate one or more of the following parameters for each target species in a 
sample: (1) a unique identification code, which can include, e.g., a target molecular structure 
and/or characteristic separation coordinate (e.g., electrophoretic coordinates); (2) sample 

1 0 source; and (3) absolute and/or relative quantity of the target species present in the sample. 

The invention also provides for the storage and retrieval of a collection of target data 
in a computer data storage apparatus, which can include magnetic disks, optical disks, 
magneto-optical disks, DRAM, SRAM, SGRAM, SDRAM, RDRAM, DDR RAM, magnetic 
bubble memory devices, and other data storage devices, including CPU registers and on-CPU 

15 data storage arrays. Typically, the target data records are stored as a bit pattern in an array of 
magnetic domains on a magnetizable medium or as an array of charge states or transistor gate 
states, such as an array of cells in a DRAM device (e.g., each cell comprised of a transistor 
and a charge storage area, which may be on the transistor). In one embodiment, the invention 
provides such storage devices, and computer systems built therewith, comprising a bit pattern 

20 encoding a protein expression fingerprint record comprising unique identifiers for at least 10 
target data records cross-tabulated with target source. 

When the target is a peptide or nucleic acid, the invention preferably provides a 
method for identifying related peptide or nucleic acid sequences, comprising performing a 
computerized comparison between a peptide or nucleic acid sequence assay record stored in 

25 or retrieved from a computer storage device or database and at least one other sequence. The 
comparison can include a sequence analysis or comparison algorithm or computer program 
embodiment thereof (e.g., FASTA, TFASTA, GAP, BESTFIT) and/or the comparison may 
be of the relative amount of a peptide or nucleic acid sequence in a pool of sequences 
determined from a polypeptide or nucleic acid sample of a specimen. 
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The invention also preferably provides a magnetic disk, such as an IBM-compatible 
(DOS, Windows, Windows95/98/2000, Windows NT, OS/2) or other format (e.g., Linux, 
SunOS, Solaris, ADC, SCO Unix, VMS, MV, Macintosh, etc.) floppy diskette or hard (fixed, 
Winchester) disk drive, comprising a bit pattern encoding data from an assay of the invention 
5 in a file format suitable for retrieval and processing in a computerized sequence analysis, 
comparison, or relative quantitation method. 

The invention also provides a network, comprising a plurality of computing devices 
linked via a data link, such as an Ethernet cable (coax or lOBaseT), telephone line, ISDN 
line, wireless network, optical fiber, or other suitable signal transmission medium, whereby at 

10 least one network device (e.g., computer, disk array, etc.) comprises a pattern of magnetic 
domains (e.g., magnetic disk) and/or charge domains (e.g., an array of DRAM cells) 
composing a bit pattern encoding data acquired from an assay of the invention. 

The invention also provides a method for transmitting assay data that includes 
generating an electronic signal on an electronic communications device, such as a modem, 

15 ISDN terminal adapter, DSL, cable modem, ATM switch, or the like, wherein the signal 
includes (in native or encrypted format) a bit pattern encoding data from an assay or a 
database comprising a plurality of assay results obtained by the method of the invention. 

In a preferred embodiment, the invention provides a computer system for comparing a 
query target to a database containing an array of data structures, such as an assay result 

20 obtained by the method of the invention, and ranking database targets based on the degree of 
identity and gap weight to the target data. A central processor is preferably initialized to load 
and execute the computer program for alignment and/or comparison of the assay results. 
Data for a query target is entered into the central processor via an I/O device. Execution of 
the computer program results in the central processor retrieving the assay data from the data 

25 file, which comprises a binary description of an assay result. 

The target data or record and the computer program can be transferred to secondary 
memory, which is typically random access memory (e.g., DRAM, SRAM, SGRAM, or 
SDRAM). Targets are ranked according to the degree of correspondence between a selected 
assay characteristic (e.g., binding to a selected affinity moiety) and the same characteristic of 

30 the query target and results are output via an I/O device. For example, a central processor 

33 



WO 03/003906 



PCT/US02/21338 



can be a conventional computer (e.g., Intel Pentium, PowerPC, Alpha, PA-8000, SPARC, 
MIPS 4400, MIPS 10000, VAX, etc.); a program can be a commercial or public domain 
molecular biology software package (e.g., UWGCG Sequence Analysis Software, Darwin); a 
data file can be an optical or magnetic disk, a data server, a memory device (e.g., DRAM, 
5 SRAM, SGRAM, SDRAM, EPROM, bubble memory, flash memory, etc.); an I/O device can 
be a terminal comprising a video display and a keyboard, a modem, an ISDN terminal 
adapter, an Ethernet port, a punched card reader, a magnetic strip reader, or other suitable I/O 
device. 

The invention also preferably provides the use of a computer system, such as that 
10 described above, which comprises: (1) a computer; (2) a stored bit pattern encoding a 

collection of peptide sequence specificity records obtained by the methods of the invention, 
which may be stored in the computer; (3) a comparison target, such as a query target; and (4) 
a program for alignment and comparison, typically with rank-ordering of comparison results 
on the basis of computed similarity values. 

15 

Characteristics of bladder cancer-associated proteins 

Bladder cancer proteins of the present invention may be classified as secreted 
proteins, transmembrane proteins or intracellular proteins. In one embodiment, the bladder 
cancer protein is an intracellular protein. Intracellular proteins may be found in the 

20 cytoplasm and/or in the nucleus. Intracellular proteins are involved in all aspects of cellular 
function and replication (including, e.g., signaling pathways); aberrant expression of such 
proteins often results in unregulated or disregulated cellular processes (see, e.g., Alberts, et 
al. (1994) Molecular Biology of the Cell (3d ed.) Garland. For example, many intracellular 
proteins have enzymatic activity such as protein kinase activity, protein phosphatase activity, 

25 protease activity, nucleotide cyclase activity, polymerase activity and the like. Intracellular 
proteins also serve as docking proteins that are involved in organizing complexes of proteins, 
or targeting proteins to various subcellular localizations, and are involved in maintaining the 
structural integrity of organelles. 

An increasingly appreciated concept in characterizing proteins is the presence in the 

30 proteins of one or more structural motifs for which defined functions have been attributed. In 
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addition to the highly conserved sequences found in the enzymatic domain of proteins, highly 
conserved sequences have been identified in proteins that are involved in protein-protein 
interaction. For example, Src-homology-2 (SH2) domains bind tyrosine-phosphorylated 
targets in a sequence dependent manner. PTB domains, which are distinct from SH2 
5 domains, also bind tyrosine phosphorylated targets. SH3 domains bind to proline-rich 

targets. In addition, PH domains, tetratricopeptide repeats and WD domains to name only a 
few, have been shown to mediate protein-protein interactions. Some of these may also be 
involved in binding to phospholipids or other second messengers. As will be appreciated by 
one of ordinary skill in the art, these motifs can be identified on the basis of amino acid 

10 sequence; thus, an analysis of the sequence of proteins may provide insight into both the 
enzymatic potential of the molecule and/or molecules with which the protein may associate. 
One useful database is Pfam (protein families), which is a large collection of multiple 
sequence alignments and hidden Markov models covering many common protein domains. 
Versions are available via the internet from Washington University in St. Louis, the Sanger 

15 Center in England, and the Karolinska Institute in Sweden. See, e.g., Bateman, et al. (2000) 
Nuc. Acids Res . 28:263-266; Sonnhammer, et al. (1997) Proteins 28:405-420; Bateman, et al. 
(1999) Nuc. Acids Res. 27:260-262; and Sonnhammer, et al. (1998) Nuc. Acids Res. 26:320- 
322. 

In another embodiment, the bladder cancer sequences are transmembrane proteins. 

20 Transmembrane proteins are molecules that span a phospholipid bilayer of a cell. They may 
have an intracellular domain, an extracellular domain, or both. The intracellular domains of 
such proteins may have a number of functions including those already described for 
intracellular proteins. For example, the intracellular domain may have enzymatic activity 
and/or may serve as a binding site for additional proteins. Frequently the intracellular 

25 domain of transmembrane proteins serves both roles. For example certain receptor tyrosine 
kinases have both protein kinase activity and SH2 domains. In addition, autophosphorylation 
of tyrosines on the receptor molecule itself, creates binding sites for additional SH2 domain 
containing proteins. 

Transmembrane proteins may contain from one to many transmembrane domains. 
30 For example, receptor tyrosine kinases, certain cytokine receptors, receptor guanylyl cyclases 
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and receptor serine/threonine protein kinases contain a single transmembrane domain. 
However, various other proteins including channels and adenylyl cyclases contain numerous 
transmembrane domains. Many important cell surface receptors such as G protein coupled 
receptors (GPCRs) are classified as "seven transmembrane domain" proteins, as they contain 
5 7 membrane spanning regions. Characteristics of transmembrane domains include 

approximately 17 consecutive hydrophobic amino acids that may be followed by charged 
amino acids. Therefore, upon analysis of the amino acid sequence of a particular protein, the 
localization and number of transmembrane domains within the protein may be predicted (see, 
e.g., PSORT web site http://psort.nibb.ac.jp/). Important transmembrane protein receptors 

10 include, but are not limited to the insulin receptor, insulin-like growth factor receptor, human 
growth hormone receptor, glucose transporters, transferrin receptor, epidermal growth factor 
receptor, low density lipoprotein receptor, epidermal growth factor receptor, leptin receptor, 
and interleukin receptors, e.g., IL-1 receptor, EL-2 receptor, etc. 

The extracellular domains of transmembrane proteins are diverse; however, conserved 

1 5 motifs are found repeatedly among various extracellular domains. Conserved structure 
and/or functions have been ascribed to different extracellular motifs. Many extracellular 
domains are involved in binding to other molecules. In one aspect, extracellular domains are 
found on receptors. Factors that bind the receptor domain include circulating ligands, which 
may be peptides, proteins, or small molecules such as adenosine and the like. For example, 

20 growth factors such as EGF, FGF, and PDGF are circulating growth factors that bind to their 
cognate receptors to initiate a variety of cellular responses. Other factors include cytokines, 
mitogenic factors, neurotrophic factors and the like. Extracellular domains also bind to cell- 
associated molecules. In this respect, they mediate cell-cell interactions. Cell-associated 
ligands can be tethered to the cell, e.g., via a glycosylphosphatidylinositol (GPI) anchor, or 

25 may themselves be transmembrane proteins. Extracellular domains also associate with the 
extracellular matrix and contribute to the maintenance of the cell structure. 

Bladder cancer proteins that are transmembrane are particularly preferred in the 
present invention as they are readily accessible targets for immunotherapeutics, as are 
described herein. In addition, as outlined below, transmembrane proteins can be also useful 

30 in imaging modalities. Antibodies may be used to label such readily accessible proteins in 
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situ. Alternatively, antibodies can also label intracellular proteins, in which case samples are 
typically permeablized to provide access to intracellular proteins. 

It will also be appreciated by those in the art that a transmembrane protein can be 
made soluble by removing transmembrane sequences, e.g., through recombinant methods. 
5 Furthermore, transmembrane proteins that have been made soluble can be made to be 
secreted through recombinant means by adding an appropriate signal sequence. 

In another embodiment, the bladder cancer proteins are secreted proteins; the 
secretion of which can be either constitutive or regulated. These proteins may have a signal 
peptide or signal sequence that targets the molecule to the secretory pathway. Secreted 

10 proteins are involved in numerous physiological events; e.g., if circulating, they often serve 
to transmit signals to various other cell types. The secreted protein may function in an 
autocrine manner (acting on the cell that secreted the factor), a paracrine manner (acting on 
cells in close proximity to the cell that secreted the factor), an endocrine manner (acting on 
cells at a distance, e.g, secretion into the blood stream), or exocrine (secretion, e.g., through a 

15 duct or to adjacent epithelial surface as sweat glands, sebaceous glands, pancreatic ducts, 
lacrimal glands, mammary glands, wax producing glands of the ear, etc.). Thus secreted 
molecules often find use in modulating or altering numerous aspects of physiology. Bladder 
cancer proteins that are secreted or released proteins are particularly preferred in the present 
invention as they serve as good targets for diagnostic markers, e.g., for blood, plasma, serum, 

20 or urine tests. Those which are enzymes may be antibody or small molecule targets. Others 
may be useful as vaccine targets, e.g., via CTL mechanisms. 

Use of bladder cancer nucleic acids 

As described above, bladder cancer sequence is initially identified by substantial 
25 nucleic acid and/or amino acid sequence homology or linkage to the bladder cancer 

sequences outlined herein. Such homology can be based upon the overall nucleic acid or 
amino acid sequence, and is generally determined as outlined below, using either homology 
programs or hybridization conditions. Typically, linked sequences on a mRNA are found on 
the same molecule. 
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The bladder cancer nucleic acid sequences of the invention, e.g., the sequences in 
Tables 1 A-13, can be fragments of larger genes, e.g., they are nucleic acid segments. 
"Genes" in this context includes coding regions, non-coding regions, and mixtures of coding 
and non-coding regions. Accordingly, as will be appreciated by those in the art, using the 
5 sequences provided herein, extended sequences, in either direction, of the bladder cancer 
genes can be obtained, using techniques well known in the art for cloning either longer 
sequences or the full length sequences; see Ausubel, et al., supra. Much can be done by 
informatics and many sequences can be clustered to include multiple sequences 
corresponding to a single gene, e.g., systems such as UniGene (see, 

1 0 http://www.ncbi.nlm.nih.gov/UniGene/). 

Once a bladder cancer nucleic acid is identified, it can be cloned and, if necessary, its 
constituent parts recombined to form the entire bladder cancer nucleic acid coding regions or 
the entire mRNA sequence. Once isolated from its natural source, e.g., contained within a 
plasmid or other vector or excised therefrom as a linear nucleic acid segment, the 

15 recombinant bladder cancer nucleic acid can be further-used as a probe to identify and isolate 
other bladder cancer nucleic acids, e.g., extended coding regions. It can also be used as a 
"precursor" nucleic acid to make modified or variant bladder cancer nucleic acids and 
proteins. 

The bladder cancer nucleic acids of the present invention are used in several ways. In 
20 a first embodiment, nucleic acid probes to the bladder cancer nucleic acids are made and 

attached to biochips to be used in screening and diagnostic methods, as outlined below, or for 
administration, e.g., for gene therapy, vaccine, and/or antisense/inhibition applications. 
Alternatively, the bladder cancer nucleic acids that include coding regions of bladder cancer 
proteins can be put into expression vectors for the expression of bladder cancer proteins, 
25 again for screening purposes or for administration to a patient. 

In a preferred embodiment, nucleic acid probes to bladder cancer nucleic acids (both 
the nucleic acid sequences outlined in the tables and/or the complements thereof) are made. 
The nucleic acid probes attached to the biochip are designed to be substantially 
complementary to the bladder cancer nucleic acids, e.g., the target sequence (either the target 
30 sequence of the sample or to other probe sequences, e.g., in sandwich assays), such that 
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hybridization of the target sequence and the probes of the present invention occurs. As 
outlined below, this complementarity need not be perfect; there may be a number of base pair 
mismatches which will interfere with hybridization between the target sequence and the 
single stranded nucleic acids of the present invention. However, if the number of mutations 
5 is so great that no hybridization can occur under even the least stringent of hybridization 
conditions, the sequence is not a complementary target sequence. Thus, by "substantially 
complementary" herein is meant that the probes are sufficiently complementary to the target 
sequences to hybridize under normal reaction conditions, particularly high stringency 
conditions, as outlined herein. 

1 0 A nucleic acid probe is generally single stranded but can be partially single and 

partially double stranded. The strandedness of the probe is dictated by the structure, 
composition, and properties of the target sequence. In general, the nucleic acid probes range 
from about 8 to about 100 bases long, with from about 10 to about 80 bases being preferred, 
and from about 30 to about 50 bases being particularly preferred. That is, generally whole 

1 5 genes are not used. In some embodiments, much longer nucleic acids can be used, up to 
hundreds of bases. 

In a preferred embodiment, more than one probe per sequence is used, with either 
overlapping probes or probes to different sections of the target being used. That is, two, 
three, four or more probes, with three being preferred, are used to build in a redundancy for a 

20 particular target. The probes can be overlapping (e.g., have some sequence in common), or 
separate. In some cases, PCR primers may be used to amplify signal for higher sensitivity. 

As will be appreciated by those in the art, nucleic acids can be attached or 
immobilized to a solid support in a wide variety of ways. By "immobilized" and grammatical 
equivalents herein is meant the association or binding between the nucleic acid probe and the 

25 solid support is sufficient to be stable under the conditions of binding, washing, analysis, and 
removal as outlined below. The binding can typically be covalent or non-covalent. By "non- 
covalent binding" and grammatical equivalents herein is meant one or more of electrostatic, 
hydrophilic, and hydrophobic interactions. Included in non-covalent binding is the covalent 
attachment of a molecule, such as, streptavidin to the support and the non-covalent binding of 

30 the biotinylated probe to the streptavidin. By "covalent binding" and grammatical 

39 



WO 03/003906 



PCT/US02/21338 



equivalents herein is meant that the two moieties, the solid support and the probe, are 
attached by at least one bond, including sigma bonds, pi bonds and coordination bonds. 
Covalent bonds can be formed directly between the probe and the solid support or can be 
formed by a cross linker or by inclusion of a specific reactive group on either the solid 
5 support or the probe or both molecules. Immobilization may also involve a combination of 
covalent and non-covalent interactions. 

In general, the probes are attached to the biochip in a wide variety of ways, as will be 
appreciated by those in the art. As described herein, the nucleic acids can either be 
synthesized first, with subsequent attachment to the biochip, or can be directly synthesized on 
10 the biochip. 

The biochip comprises a suitable solid substrate. By "substrate" or "solid support" or 
other grammatical equivalents herein is meant a material that can be modified to contain 
discrete individual sites appropriate for the attachment or association of the nucleic acid 
probes and is amenable to at least one detection method. As will be appreciated by those in 

15 the art, the number of possible substrates are very large, and include, but are not limited to, 
glass and modified or functionalized glass, plastics (including acrylics, polystyrene and 
copolymers of styrene and other materials, polypropylene, polyethylene, polybutylene, 
polyurethanes, TeflonJ, etc.), polysaccharides, nylon or nitrocellulose, resins, silica or silica- 
based materials including silicon and modified silicon, carbon, metals, inorganic glasses, 

20 plastics, etc. In general, the substrates allow optical detection and do not appreciably 
fluoresce. See WO 00/55627. 

Generally the substrate is planar, although as will be appreciated by those in the art, 
other configurations of substrates may be used as well. For example, the probes may be 
placed on the inside surface of a tube, for flow-through sample analysis to minimize sample 

25 volume. Similarly, the substrate may be flexible, such as a flexible foam, including closed 
cell foams made of particular plastics. 

In a preferred embodiment, the surface of the biochip and the probe may be 
derivatized with chemical functional groups for subsequent attachment of the two. Thus, e.g., 
the biochip is derivatized with a chemical functional group including, but not limited to, 

30 amino groups, carboxy groups, oxo groups and thiol groups, with amino groups being 
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particularly preferred. Using these functional groups, the probes can be attached using 
functional groups on the probes. For example, nucleic acids containing amino groups can be 
attached to surfaces comprising amino groups, e.g., using linkers as are known in the art; e.g., 
homo-or hetero-bifunctional linkers as are well known (see 1994 Pierce Chemical Company 
5 catalog, technical section on cross-linkers, pages 155-200). In addition, in some cases, 

additional linkers, such as alkyl groups (including substituted and heteroalkyl groups) may be 
used. 

In this embodiment, oligonucleotides are synthesized as is known in the art, and then 
attached to the surface of the solid support. As will be appreciated by those skilled in the art, 
10 either the 5' or 3' terminus may be attached to the solid support, or attachment may be via an 
internal nucleoside. 

In another embodiment, the immobilization to the solid support may be very strong, 
yet non-covalent. For example, biotinylated oligonucleotides can be made, which bind to 
surfaces covalently coated with streptavidin, resulting in attachment. 

1 5 Alternatively, the oligonucleotides may be synthesized on the surface, as is known in 

the art. For example, photoactivation techniques utilizing photopolymerization compounds 
and techniques are used. In a preferred embodiment, the nucleic acids can be synthesized in 
situ, using well known photolithographic techniques, such as those described in WO 
95/25116; WO 95/35505; U.S. Patent Nos. 5,700,637 and 5,445,934; and references cited 

20 within, all of which are expressly incorporated by reference; these methods of attachment 
form the basis of the Affimetrix GeneChip™ technology. 

Often, amplification-based assays are performed to measure the expression level of 
bladder cancer-associated sequences. These assays are typically performed in conjunction 
with reverse transcription. In such assays, a bladder cancer-associated nucleic acid sequence 

25 acts as a template in an amplification reaction (e.g., Polymerase Chain Reaction, or PCR). Li 
a quantitative amplification, the amount of amplification product will be proportional to the 
amount of template in the original sample. Comparison to appropriate controls provides a 
measure of the amount of bladder cancer-associated RNA. Methods of quantitative 
amplification are well known to those of skill in the art. Detailed protocols for quantitative 
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PCR are provided, e.g., in Innis, et al. (1990) PCR Protocols: A Guide to Methods and 
Applications Academic Press. 

In some embodiments, a TaqMan based assay is used to measure expression. 
TaqMan based assays use a fluorogenic oligonucleotide probe that contains a 5 1 fluorescent 
5 dye and a 3' quenching agent. The probe hybridizes to a PCR product, but cannot itself be 
extended due to a blocking agent at the 3 1 end. When the PCR product is amplified in 
subsequent cycles, the 5' nuclease activity of the polymerase, e.g., AmpliTaq, results in the 
cleavage of the TaqMan probe. This cleavage separates the 5' fluorescent dye and the 3 ! 
quenching agent, thereby resulting in an increase in fluorescence as a function of 

10 amplification. See, e.g., literature provided by Perkin-Ehner, e.g., www2.perkin-elmer.com. 
Other suitable amplification methods include, but are not limited to, ligase chain 
reaction (LCR) (see Wu and Wallace (1989) Genomics 4:560-569; Landegren, et al. (1988) 
Science 241:1077-1080; and Barringer, et al. (1990) Gene 89:117-122), transcription 
amplification (Kwoh, et al. (1989) Proc. Natl Acad. Sci. USA 86:1173-1177), self-sustained 

15 sequence replication (Guatelli, et al. (1990) Proc. Nat'l Acad. Sci. USA 87:1874-1878), dot 
PCR, and linker adapter PCR, etc. 

Expression of bladder cancer proteins from nucleic acids 

In a preferred embodiment, bladder cancer nucleic acids, e.g., encoding bladder 
20 cancer proteins, are used to make a variety of expression vectors to express bladder cancer 
proteins which can then be used in screening assays, as described below. Expression vectors 
and recombinant DNA technology are well known to those of skill in the art (see, e.g., 
Ausubel, supra, and Fernandez and Hoeffler (eds. 1999) Gene Expression Systems Academic 
Press) and are used to express proteins. The expression vectors may be either self-replicating 
25 extrachromosomal vectors or vectors which integrate into a host genome. Generally, these 
expression vectors include transcriptional and translational regulatory nucleic acid operably 
linked to the nucleic acid encoding the bladder c v ancer protein. The term "control sequences" 
refers to DNA sequences used for the expression of an operably linked coding sequence in a 
particular host organism. Control sequences that are suitable for prokaryotes, e.g., include a 
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promoter, optionally an operator sequence, and a ribosome binding site. Eukaryotic cells are 
known to utilize promoters, polyadenylation signals, and enhancers. 

Nucleic acid is "operably linked" when it is placed into a functional relationship with 
another nucleic acid sequence. For example, DNA for a presequence or secretory leader is 
5 operably linked to DNA for a polypeptide if it is expressed as a preprotein that participates in 
the secretion of the polypeptide; a promoter or enhancer is operably linked to a coding 
sequence if it affects the transcription of the sequence; a ribosome binding site is operably 
linked to a coding sequence if it is positioned so as to facilitate translation; two sequences 
may be operably linked if they are physically linked on a single polynucleotide. Generally, 

10 "operably linked' 1 means that the DNA sequences being linked are contiguous, and, in the 
case of a secretory leader, contiguous and in reading phase. However, enhancers do not have 
to be contiguous. Linking is typically accomplished by ligation at convenient restriction 
sites. If such sites do not exist, synthetic oligonucleotide adaptors or linkers are used in 
accordance with conventional practice. Transcriptional and translational regulatory nucleic 

15 acid will generally be appropriate to the host cell used to express the bladder cancer protein. 
Numerous types of appropriate expression vectors, and suitable regulatory sequences are 
known in the art for a variety of host cells. 

In general, transcriptional and translational regulatory sequences may include, but are 
not limited to, promoter sequences, ribosomal binding sites, transcriptional start and stop 

20 sequences, translational start and stop sequences, and enhancer or activator sequences. In a 
preferred embodiment, the regulatory sequences include a promoter and transcriptional start 
and stop sequences. 

Promoter sequences encode either constitutive or inducible promoters. The promoters 
may be either naturally occurring promoters or hybrid promoters. Hybrid promoters, which 
25 combine elements of more than one promoter, are useful in the present invention. 

In addition, an expression vector may comprise additional elements. The expression 
vector may have two replication systems, thus allowing it to be maintained in two organisms, 
e.g., in mammalian or insect cells for expression and in a procaryotic host for cloning and 
replication. For integrating expression vectors, the expression vector may contain at least one 
30 sequence homologous to the host cell genome, and preferably two homologous sequences 
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which flank the expression construct. The integrating vector may be directed to a specific 
locus in the host cell by selecting the appropriate homologous sequence for inclusion in the 
vector. Constructs for integrating vectors are well known (e.g., Fernandez and Hoeffler, 
supra). 

5 In addition, in a preferred embodiment, the expression vector contains a selectable 

marker gene to allow the selection of transformed host cells. Selection genes are well known 
in the art and will vary with the host cell used. 

The bladder cancer proteins of the present invention may be produced by culturing a 
host cell transformed with an expression vector under the appropriate conditions to induce or 

1 0 cause expression of the bladder cancer protein. Conditions appropriate for bladder cancer 
protein expression will vary with the choice of the expression vector and the host cell, and 
will be easily ascertained by one skilled in the art through routine experimentation or 
optimization. For example, the use of constitutive promoters in the expression vector will 
typically require optimizing the growth and proliferation of the host cell, while the use of an 

1 5 inducible promoter typically requires identifying the appropriate growth conditions for 
induction. In addition, in some embodiments, the timing of the harvest is important. For 
example, the baculoviral systems used in insect cell expression are lytic viruses, and harvest 
time selection can be crucial for product yield. 

Appropriate host cells include yeast, bacteria, archaebacteria, fungi, insect, and 

20 animal cells, including mammalian cells. Of particular interest are Saccharomyces cerevisiae 
and other yeasts, E. coli, Bacillus subtilis, Sf9 cells, C129 cells, 293 cells, Neurospora, BHK, 
CHO, COS, HeLa cells, HUVEC (human umbilical vein endothelial cells), THP1 cells (a 
macrophage cell line) and various other human cells and cell lines. 

In a preferred embodiment, the bladder cancer proteins are expressed in mammalian 

25 cells. Mammalian expression systems include retroviral and adenoviral systems. Retroviral 
vector systems are described in PCT/US97/01019 and PCT/US97/01048. Of particular use 
are promoters from mammalian viral genes, since viral genes are often highly expressed and 
have a broad host range. Examples include the SV40 early promoter, mouse mammary tumor 
virus LTR promoter, adenovirus major late promoter, herpes simplex virus promoter, and the 

30 CMV promoter (see, e.g., Fernandez and Hoeffler, supra). Typically, transcription 
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termination and polyadenylation sequences recognized by mammalian cells are regulatory 
regions located 3' to the translation stop codon and thus, together with the promoter elements, 
flank the coding sequence. Examples of transcription terminator and polyadenlyation signals 
include those derived from SV40. 
5 Methods of introducing exogenous nucleic acid into mammalian and other hosts are 

well known, and will vary with the host cell used. Techniques include dextran-mediated 
transfection, calcium phosphate precipitation, polybrene mediated transfection, protoplast 
fusion, electroporation, viral infection, encapsulation of the polynucleotide(s) in liposomes, 
and direct microinjection of the DNA into nuclei. 

10 In another embodiment, bladder cancer proteins are expressed in bacterial systems. 

Promoters from bacteriophage may also be used. Synthetic promoters and hybrid promoters 
are also useful; e.g., the tac promoter is a hybrid of the trp and lac promoter sequences. A 
bacterial promoter can include naturally occurring promoters of non-bacterial origin that have 
the ability to bind bacterial RNA polymerase and initiate transcription. Often an efficient 

15 ribosome binding site is desirable. The expression vector may include a signal peptide 
sequence that provides for secretion of the bladder cancer protein. The protein is either 
secreted into the growth media (gram-positive bacteria) or into the periplasmic space, located 
between the inner and outer membrane of the cell (gram-negative bacteria). The bacterial 
expression vector may include a selectable marker gene to allow for the selection of bacterial 

20 strains that have been transformed. Suitable selection genes include genes which render the 
bacteria resistant to drugs, e.g., ampicillin, chloramphenicol, erythromycin, kanamycin, 
neomycin, and tetracycline, or biosynthetic genes, e.g., those in the histidine, tryptophan, and 
leucine biosynthetic pathways. These components are assembled into expression vectors. 
Expression vectors for bacteria include vectors for Bacillus subtilis, E. coli, Streptococcus 

25 cremoris, and Streptococcus lividans, among others (e.g., Fernandez and Hoeffler, supra). 
The bacterial expression vectors are transformed into bacterial host cells using, e.g., calcium 
chloride treatment, electroporation, and other methods. 

Bladder cancer proteins can also be produced in insect cells. See, e.g., Miller, et al. 
( 1 997) Baculovirus Expression Vectors: A Laboratory Manual Oxford Books; ISBN: 

30 0716770172; and Makrides (1999) Prot. Expr. Purif. 17:183-202. 
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Bladder cancer protein may be produced in yeast cells. Yeast expression systems 
exist with expression vectors for Saccharomyces cerevisiae, Candida albicans and C. maltosa, 
Hansenulapolymorpha, Kluyveromyces fragilis and K. lactis, Pichia guillerimondii and P. 
pastoris, Schizosaccharomyces pombe, and Yarrowia lipolytica. See, e.g., Jones, et al. (eds. 
5 1 993) The Molecular and Cellular Biology of the Yeast Saccharomyces: Gene Expression 
CSH Press; ISBN: 0879693657. 

The bladder cancer protein may also be made as a fusion protein, using techniques 
well known in the art. Thus, e.g., for the creation of monoclonal antibodies, if the desired 
epitope is small, the bladder cancer protein may be fused to a carrier protein to form an 

1 0 immunogen. Alternatively, the bladder cancer protein may be made as a fusion protein to 
increase expression, or for other reasons. For example, when the bladder cancer protein is a 
bladder cancer peptide, the nucleic acid encoding the peptide may be linked to other nucleic 
acid for expression or purification purposes. 

The bladder cancer protein is typically purified or isolated after expression. Bladder 

1 5 cancer proteins may be isolated or purified in a variety of ways, depending on what other 

components are present in the sample. Standard purification methods include electrophoretic, 
molecular, immunological, and chromatographic techniques, including ion exchange, 
hydrophobic, affinity, reverse-phase HPLC chromatography, and chromatofocusing. The 
bladder cancer protein may be purified using a standard anti-bladder cancer protein antibody 

20 affinity column. Ultrafiltration and diafiltration techniques, in conjunction with protein 

concentration, are also useful. For general guidance in suitable purification techniques, see, 
e.g., Scopes (1982) Protein Purification Springer- Verlag. The degree of purification 
necessary will vary depending on the use of the bladder cancer protein. In some instances no 
purification will be necessary, which may depend on the intended use. 

25 Once expressed and purified, if necessary, the bladder cancer proteins and nucleic 

acids are useful in a number of applications. They may be used as immunoselection reagents, 
as vaccine reagents, as screening agents, etc. 

Variants of bladder cancer proteins 
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In one embodiment, the bladder cancer proteins are derivative or variant bladder 
cancer proteins as compared to the wild-type sequence. That is, as outlined more fully below, 
the derivative bladder cancer peptide will often contain at least one amino acid substitution, 
deletion, or insertion, with amino acid substitutions being particularly preferred. The amino 
5 acid substitution, insertion, or deletion may occur at most residues within the bladder cancer 
peptide. 

Certain embodiments of bladder cancer proteins of the present invention are amino 
acid sequence variants. These variants typically fall into one or more of three classes: 
substitutional, insertional, or deletional variants. These variants ordinarily are prepared by 

10 site specific mutagenesis of nucleotides in the DNA encoding the bladder cancer protein, 
using cassette or PCR, mutagenesis, or other techniques well known in the art, to produce 
DNA encoding the variant, and thereafter expressing the DNA in recombinant cell culture as 
outlined above. However, variant bladder cancer protein fragments having up to about 100- 
150 residues may be prepared by in vitro synthesis using established techniques. Amino acid 

15 sequence variants are often characterized by the predetermined nature of the variation, a 
feature that sets them apart from naturally occurring allelic or interspecies variation of the 
bladder cancer protein amino acid sequence. The variants typically exhibit the same 
qualitative biological activity as the naturally occurring analogue, although variants can also 
be selected which have modified characteristics as will be more fully outlined below. 

20 While the site or region for introducing an amino acid sequence variation is often 

predetermined, the mutation per se need not be predetermined. To optimize the performance 
of a mutation at a given site, random mutagenesis may be conducted at the target codon or 
region and the expressed bladder cancer variants screened for the optimal combination of 
desired activities. Techniques for making substitution mutations at predetermined sites in 

25 DNA having a known sequence are well known, e.g., Ml 3 primer mutagenesis and PCR 
mutagenesis. Screening of the mutants is performed using assays of bladder cancer protein 
activities. 

Amino acid substitutions are typically of single residues; insertions usually will be on 
the order of from about 1 to 20 amino acids, although considerably larger insertions may be 
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tolerated. Deletions range from about 1-20 residues, although in some cases deletions may be 
much larger. 

Substitutions, deletions, insertions, or combinations thereof may be used to arrive at a 
final derivative. Generally these changes are done on a few amino acids to minimize the 
5 alteration of the molecule. However, larger changes may be tolerated in certain 

circumstances. When small alterations in the characteristics of the bladder cancer protein are 
desired, substitutions are generally made in accordance with the amino acid substitution 
relationships provided in the definition section. 

The variants typically exhibit the same qualitative biological activity and elicit the 
10 same immune response as the naturally-occurring analog, although variants also are selected 
to modify the characteristics of the bladder cancer proteins as needed. Alternatively, the 
variant may be designed such that the biological activity of the bladder cancer protein is 
altered. For example, glycosylation sites may be altered or removed. 

Substantial changes in function or immunological identity are made by selecting 
15 substitutions that are less conservative than those described above. Substitutions may be 

made which more significantly affect: the structure of the polypeptide backbone in the area of 
the alteration, e.g., the alpha-helical or beta-sheet structure; the charge or hydrophobicity of 
the molecule at the target site; or the bulk of the side chain. Substitutions which are expected 
to produce the greatest changes in the polypeptide's properties are those in which (a) a 
20 hydrophilic residue, e.g., serine or threonine is substituted for (or by) a hydrophobic residue, 
e.g., leucine, isoleucine, phenylalanine, valine, or alanine; (b) a cysteine or proline is 
substituted for (or by) another residue; (c) a residue having an electropositive side chain, e.g., 
lysine, arginine, or histidine, is substituted for (or by) an electronegative residue, e.g., 
glutamic acid or aspartitc acid; or (d) a residue having a bulky side chain, e.g., phenylalanine, 
25 is substituted for (or by) one not having a side chain, e.g., glycine, 

Covalent modifications of bladder cancer polypeptides are included within the scope 
of this invention. One type of covalent modification includes reacting targeted amino acid 
residues of a bladder cancer polypeptide with an organic derivatizing agent that is capable of 
reacting with selected side chains or the N-or C-terminal residues of a bladder cancer 
30 polypeptide. Derivatization with Afunctional agents is useful, e.g., for crosslinking bladder 
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cancer polypeptides to a water-insoluble support matrix or surface for use in the method for 
purifying anti-bladder cancer polypeptide antibodies or screening assays. Commonly used 
crosslinking agents include, e.g., l,l-bis(diazoacetyl)-2-phenylethane, glutaraldehyde, N- 
hydroxysuccinimide esters, e.g., esters with 4-azidosalicylic acid, homobifunctional 
5 imidoesters, including disuccinimidyl esters such as 3,3 ! -di1hiobis(succiiiimidylpropionate), 
Afunctional maleimides such as bis-N-maleimido-l,8-octane and agents such as methyl-3- 
((p-azidophenyl)dithio)propioimidate. 

Other modifications include deamidation of glutaminyl and asparaginyl residues to 
the corresponding glutamic and aspartic residues, respectively, hydroxylation of proline and 

1 0 lysine, phosphorylation of hydroxyl groups of serine, threonine or tyrosine residues, 

methylation of the amino groups of the lysine, arginine, and histidine side chains (pp. 79-86, 
Creighton (1984) Proteins: Structure and Molecular Properties Freeman), acetylation of the 
N-terminal amine, and amidation of a C-terminal carboxyl group. 

Another type of covalent modification of the bladder cancer polypeptide included 

15 within the scope of this invention comprises altering the native glycosylation pattern of the 
polypeptide. "Altering the native glycosylation pattern" is intended for purposes herein to 
mean deleting one or more carbohydrate moieties found in native sequence bladder cancer 
polypeptide, and/or adding one or more glycosylation sites that are not present in the native 
sequence bladder cancer polypeptide. Glycosylation patterns can be altered in many ways. 

20 For example the use of different cell types to express bladder cancer-associated sequences 
can result in different glycosylation patterns. 

Addition of glycosylation sites to bladder cancer polypeptides may also be 
accomplished by-altering the amino acid sequence thereof. The alteration may be made, e.g., 
by the addition of, or substitution by, one or more serine or threonine residues to the native 

25 sequence bladder cancer polypeptide (for O-linked glycosylation sites). The bladder cancer 
amino acid sequence may optionally be altered through changes at the DNA level, 
particularly by mutating the DNA encoding the bladder cancer polypeptide at preselected 
bases such that codons are generated that will translate into the desired amino acids. 

Another means of increasing the number of carbohydrate moieties on the bladder 

30 cancer polypeptide is by chemical or enzymatic coupling of glycosides to the polypeptide. 

49 



WO 03/003906 



PCT/US02/21338 



Such methods are described in the art, e.g., in WO 87/05330, and pp. 259-306 in Aplin and 
Wriston (1981) CRC Crit. Rev. Biochem. 

Removal of carbohydrate moieties present on the bladder cancer polypeptide may be 
accomplished chemically or enzymatically or by mutational substitution of codons encoding 
5 amino acid residues that serve as targets for glycosylation. Chemical deglycosylation 
techniques are known in the art. See, e.g., Hakimuddin, et al. (1987) Arch. Biochem. 
Biophvs. 259:52-57; and Edge, et al. (1981) Anal. Biochem. 118:131-137, Enzymatic 
cleavage of carbohydrate moieties on polypeptides can be achieved by the use of a variety of 
endo-and exo-glycosidases. See, e.g., Thotakura, et al. (1987) Meth. Enzvmol. 138:350-359. 

1 0 Another type of covalent modification of bladder cancer comprises linking the 

bladder cancer polypeptide to one of a variety of nonproteinaceous polymers, e.g., 
polyethylene glycol, polypropylene glycol, or polyoxyalkylenes, in the manner set forth in 
U.S. Patent Nos. 4,640,835; 4,496,689; 4,301,144; 4,670,417; 4,791,192; or 4,179,337. 

Bladder cancer polypeptides of the present invention may also be modified to form 

1 5 chimeric molecules comprising a bladder cancer polypeptide fused to a heterologous 

polypeptide or amino acid sequence. In one embodiment, a chimeric molecule comprises a 
fusion of a bladder cancer polypeptide with an epitope tag. The epitope tag is generally 
placed at the amino-or carboxyl-terminus of the bladder cancer polypeptide. The presence of 
such epitope-tagged forms of a bladder cancer polypeptide can be detected using an antibody 

20 against the tag polypeptide. Also, provision of the epitope tag enables the bladder cancer 
polypeptide to be readily purified by affinity purification using an anti-tag antibody or 
another type of affinity matrix that binds to the epitope tag. In an alternative embodiment, 
the chimeric molecule may comprise a fusion of a bladder cancer polypeptide with an 
immunoglobulin or a particular region of an immunoglobulin. For a bivalent form of the 

25 chimeric molecule, such a fusion could be to the Fc region of an IgG molecule. 

Various tag polypeptides and their respective antibodies are well known in the art. 
Examples include poly-histidine (poly-his) or poly-histidine-glycine (poly-his-gly) tags; HIS6 
and metal chelation tags, the flu HA tag polypeptide and its antibody 12CA5 (Field, et al. 
(1988) Mol. Cell. Biol. 8:2159-2165); the c-myc tag and the 8F9, 3C7, 6E10, G4, B7, and 

30 9E10 antibodies thereto (Evan, et al. (1985) Molecular and Cellular Biology 5:3610-3616); 
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and the Herpes Simplex virus glycoprotein D (gD) tag and its antibody (Paborsky, et al. 
(1990) Protein Engineering 3:547-553). Other tag polypeptides include the Flag-peptide 
(Hopp, et al. (1988) BioTechnologv 6:1204-1210); the KT3 epitope peptide (Martin, et al. 
(1992) Science 255:192-194); tubulin epitope peptide (Skinner, et al. (1991) J. Biol. Chem. 
5 266:15163-15166); and the T7 gene 10 protein peptide tag (Lutz-Freyennuth, et al. (1990) 
Proc. Nat'l Acad. Sci. USA 87:6393-6397). 

Also included are other bladder cancer proteins of the bladder cancer family, and 
bladder cancer proteins from other organisms, which are cloned and expressed as outlined 
below. Thus, probe or degenerate polymerase chain reaction (PCR) primer sequences may be 

1 0 used to find other related bladder cancer proteins from humans or other organisms. As will 
be appreciated by those in the art, particularly useful probe and/or PCR primer sequences 
include the unique areas of the bladder cancer nucleic acid sequence. Preferred PCR primers 
are from about 15-35 nucleotides in length, with from about 20-30 being preferred, and may 
contain inosine as needed. The conditions for the PCR reaction are well known. See, e.g., 

1 5 Innis (1990) PCR Protocols, supra. 

Antibodies to bladder cancer proteins 

In a preferred embodiment, when the bladder cancer protein is to be used to generate 
antibodies, e.g., for immunotherapy or immunodiagnosis, the bladder cancer protein should 

20 share at least one epitope Or determinant with the full length protein. By "epitope" or 

"determinant" herein is typically meant a portion of a protein which will generate and/or bind 
an antibody or T-cell receptor in the context of MHC. Thus, in most instances, antibodies 
made to a smaller bladder cancer protein will be able to bind to the full-length protein, 
particularly linear epitopes. In a preferred embodiment, the epitope is unique; that is, 

25 antibodies generated to a unique epitope show little or no cross-reactivity. 

Methods of preparing polyclonal antibodies are known (see, e.g., Coligan, supra; and 
Harlow and Lane, supra). Polyclonal antibodies can be raised in a mammal, e.g., by one or 
more injections of an immunizing agent and, if desired, an adjuvant. Typically, the 
immunizing agent and/or adjuvant will be injected in the mammal by multiple subcutaneous 

30 or intraperitoneal injections. The immunizing agent may include a protein encoded by a 
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nucleic acid of the tables or fragment thereof or a fusion protein thereof. It may be useful to 
conjugate the immunizing agent to a protein known to be immunogenic in the mammal being 
immunized. Examples of such immunogenic proteins include but are not limited to keyhole 
limpet hemocyanin, serum albumin, bovine thyroglobulin, and soybean trypsin inhibitor. 
5 Examples of adjuvants which may be employed include Freund's complete adjuvant and 
MPL-TDM adjuvant (monophosphoryl Lipid A, synthetic trehalose dicorynomycolate). The 
immunization protocol may be selected as appropriate. 

The antibodies may be monoclonal antibodies. Monoclonal antibodies may be 
prepared using hybridoma methods, such as those described by Kohler and Milstein (1975) 

1 0 Nature 256:495-497. In a hybridoma method, a mouse, hamster, or other appropriate host 
animal, is typically immunized with an immunizing agent to elicit lymphocytes that produce 
or are capable of producing antibodies that will specifically bind to the immunizing agent. 
Alternatively, the lymphocytes may be immunized in vitro. The immunizing agent will 
typically include a polypeptide encoded by a nucleic acid of Tables 1A-13 or fragment 

1 5 thereof, or a fusion protein thereof. Generally, either peripheral blood lymphocytes ("PBLs") 
are used if cells of human origin are desired, or spleen cells or lymph node cells are used if 
non-human mammalian sources are desired. The lymphocytes are then fused with an 
immortalized cell line using a suitable fusing agent, such as polyethylene glycol, to form a 
hybridoma cell (pp. 59-103, Goding (1986) Monoclonal Antibodies: Principles and Practice 

20 Academic Press). Immortalized cell lines are usually transformed mammalian cells, 
particularly myeloma cells of rodent, bovine and human origin. Usually, rat or mouse 
myeloma cell lines are employed. The hybridoma cells may be cultured in a suitable culture 
medium that preferably contains one or more substances that inhibit the growth or survival of 
the unfiised, immortalized cells. For example, if the parental cells lack the enzyme 

25 hypoxanthine guanine phosphoribosyl transferase (HGPRT or HPRT), the culture medium 
for the hybridomas typically will include hypoxanthine, aminopterin, and thymidine ("HAT 
medium"), which substances prevent the growth of HGPRT-deficient cells. 

In one embodiment, the antibodies are bispecific antibodies. Bispecific antibodies are 
typically monoclonal, preferably human or humanized, antibodies that have binding 

30 specificities for at least two different antigens or that have binding specificities for two 
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epitopes on the same antigen. In one embodiment, one of the binding specificities is for a 
protein encoded by a nucleic acid Tables 1A-13 or a fragment thereof, the other one is for 
another antigen, and preferably for a cell-surface protein or receptor or receptor subunit, 
preferably one that is tumor specific. Alternatively, tetramer-type technology may create 
5 multivalent reagents. 

In a preferred embodiment, the antibodies to bladder cancer protein are capable of 
reducing or eliminating a biological function of a bladder cancer protein, as is described 
below. That is, the addition of anti-bladder cancer protein antibodies (either polyclonal or 
preferably monoclonal) to bladder cancer tissue (or cells containing bladder cancer) may 

10 reduce or eliminate the bladder cancer. Generally, at least about 25% decrease in activity, 
growth, size, or the like is preferred, with at least about 50% being particularly preferred, and 
about a 95-100% decrease being especially preferred. 

In a preferred embodiment the antibodies to the bladder cancer proteins are 
humanized antibodies (e.g., Xenerex Biosciences; Medarex, Inc.; Abgenix, Inc.; Protein 

15 Design Labs, Inc.) Humanized forms of non-human (e.g., murine) antibodies are chimeric 
molecules of immunoglobulins, immunoglobulin chains or fragments thereof (such as Fv, 
Fab, Fab', F(ab')2 or other antigen-binding subsequences of antibodies) which contain 
minimal sequence derived from non-human immunoglobulin. Humanized antibodies include 
human immunoglobulins (recipient antibody) in which residues from a complementary 

20 determining region (CDR) of the recipient are replaced by residues from a CDR of a non- 
human species (donor antibody) such as mouse, rat, or rabbit having the desired specificity, 
affinity and capacity. In some instances, Fv framework residues of the human 
immunoglobulin are replaced by corresponding non-human residues. Humanized antibodies 
may also comprise residues which are found neither in the recipient antibody nor in the 

25 imported CDR or framework sequences. In general, a humanized antibody will comprise 
substantially all of at least one, and typically two, variable domains, in which all or 
substantially all of the CDR regions correspond to those of a non-human immunoglobulin 
and all or substantially all of the framework (FR) regions are those of a human 
immunoglobulin consensus sequence. The humanized antibody optimally also will comprise 

30 at least a portion of an immunoglobulin constant region (Fc), typically that of a human 
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immunoglobulin. See Jones, et al. (1986) Nature 321:522-525; Riechmann, et al. (1988) 
Nature 332:323-329; and Presta (1992) Curr. Op. Struct, Biol. 2:593-596. Humanization can 
be performed, e.g., following the method of Winter and co-workers (see Jones, et al. (1986) 
Nature 321:522-525; Riechmann, et al. (1988) Nature 332:323-327; Verhoeyen, et al. (1988) 
5 Science 239:1534-1536), by substituting rodent CDRs or CDR sequences for the 

corresponding sequences of a human antibody. Accordingly, such humanized antibodies are 
chimeric antibodies (U.S. Patent No. 4,816,567), wherein substantially less than an intact 
human variable domain has been substituted by the corresponding sequence from a non- 
human species. 

1 0 Human antibodies can also be produced using various techniques known in the art, 

including phage display libraries (Hoogenboom and Winter (1991) J. Mol. Biol. 227:381- 
388; Marks, et al. (1991) J. Mol. Biol. 222:581-597) or of human monoclonal antibodies 
(e.g., p. 77, Cole, et al. in Reisfeld and Sell (1985) Monoclonal Antibodies and Cancer 
Therapy Liss; and Boerner, et al. (1991) J. Immunol. 147:86-95). Similarly, human 

1 5 antibodies can be made by introducing of human immunoglobulin loci into transgenic 

animals, e.g., mice in which the endogenous immunoglobulin genes have been partially or 
completely inactivated. Upon challenge, human antibody production is observed, which 
closely resembles that seen in humans in all respects, including gene rearrangement, 
assembly, and antibody repertoire. This approach is described, e.g., in U.S. Patent Nos. 

20 5,545,807; 5,545,806; 5,569,825; 5,625,126; 5,633,425; 5,661,016, and Marks, et al. (1992) 
Bio/Technology 10:779-783; Lonberg, et al. (1994) Nature 368:856-859; Morrison (1994) 
Nature 368:812-13; Neuberger (1996) Nature Biotechnology 14:826 commenting on 
Fishwild, et al. (1996) Nature Biotechnology 14:845-51; and Lonberg and Huszar (1995) 
Intern. Rev. Immunol. 13:65-93. 

25 By immunotherapy is meant treatment of bladder cancer with an antibody raised 

against bladder cancer proteins. As used herein, immunotherapy can be passive or active. 
Passive immunotherapy as defined herein is the passive transfer of antibody to a recipient 
(patient), which may be used to target a label or toxin. Active immunization is the induction 
of antibody and/or T-cell responses in a recipient (patient). Induction of an immune response 

30 is the result of providing the recipient with an antigen to which antibodies are raised. As 
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appreciated by one of ordinary skill in the art, the antigen may be provided by injecting a 
polypeptide against which antibodies are desired to be raised into a recipient, or contacting 
the recipient with a nucleic acid capable of expressing the antigen and under conditions for 
expression of the antigen, leading to an immune response. 
5 In a preferred embodiment the bladder cancer proteins against which antibodies are 

raised are secreted proteins as described above. Without being bound by theory, antibodies 
used for treatment, bind and prevent the secreted protein from binding to its receptor, thereby 
inactivating the secreted bladder cancer protein. 

In another preferred embodiment, the bladder cancer protein to which antibodies are 

1 0 raised is a transmembrane protein. Without being bound by theory, antibodies used for 
treatment, bind the extracellular domain of the bladder cancer protein and prevent it from 
binding to other proteins, such as circulating ligands or cell-associated molecules. The 
antibody may cause down-regulation of the transmembrane bladder cancer protein. As will 
be appreciated by one of ordinary skill in the art, the antibody may be a competitive, non- 

1 5 competitive or uncompetitive inhibitor of protein binding to the extracellular domain of the 
bladder cancer protein. The antibody is also an antagonist of the bladder cancer protein. 
Further, the antibody prevents activation of the transmembrane bladder cancer protein. In 
one aspect, when the antibody prevents the binding of other molecules to the bladder cancer 
protein, the antibody prevents growth of the cell. The antibody may also be used to target or 

20 sensitize the cell to cytotoxic agents, including, but not limited to TNF-a, TNF-p, IL-1, DsfF-y 
and IL-2, or chemotherapeutic agents including 5FU, vinblastine, actinomycin D, cisplatin, 
methotrexate, and the like. In some instances the antibody belongs to a sub-type that 
activates serum complement when complexed with the transmembrane protein thereby 
mediating cytotoxicity or antigen-dependent cytotoxicity (ADCC). Thus, bladder cancer is 

25 treated by administering to a patient antibodies directed against the transmembrane bladder 
cancer protein. Antibody-labeling may activate a co-toxin, localize a toxin payload, or 
otherwise provide means to locally ablate cells. 

In another preferred embodiment, the antibody is conjugated to an effector moiety. 
The effector moiety can be a number of molecules, including labelling moieties such as 

30 radioactive labels or fluorescent labels, or can be a therapeutic moiety. In one aspect the 
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therapeutic moiety is a small molecule that modulates the activity of the bladder cancer 
protein. In another aspect the therapeutic moiety modulates the activity of molecules 
associated with or in close proximity to the bladder cancer protein. The therapeutic moiety 
may inhibit enzymatic activity such as protease or collagenase or protein kinase activity 
5 associated with bladder cancer. 

In a preferred embodiment, the therapeutic moiety can also be a cytotoxic agent. In 
this method, targeting the cytotoxic agent to bladder cancer tissue or cells, results in a 
reduction in the number of afflicted cells, thereby reducing symptoms associated with bladder 
cancer. Cytotoxic agents are numerous and varied and include, but are not limited to, 

10 cytotoxic drugs or toxins or active fragments of such toxins. Suitable toxins and their 

corresponding fragments include diphtheria A chain, exotoxin A chain, ricin A chain, abrin A 
chain, curcin, crotin, phenomycin, enomycin, and the like. Cytotoxic agents also include 
radiochemicals made by conjugating radioisotopes to antibodies raised against bladder cancer 
proteins, or binding of a radionuclide to a chelating agent that has been covalently attached to 

15 the antibody. Targeting the therapeutic moiety to transmembrane bladder cancer proteins not 
only serves to increase the local concentration of therapeutic moiety in the bladder cancer 
afflicted area, but also serves to reduce deleterious side effects that may be associated with 
the therapeutic moiety. 

In another preferred embodiment, the bladder cancer protein against which the 

20 antibodies are raised is an intracellular protein. In this case, the antibody may be conjugated 
to a protein which facilitates entry into the cell. In one case, the antibody enters the cell by 
endocytosis. In another embodiment, a nucleic acid encoding the antibody is administered to 
the individual or cell. Moreover, wherein the bladder cancer protein can be targeted within a 
cell, e.g., the nucleus, an antibody thereto contains a signal for that target localization, e.g., a 

25 nuclear localization signal. 

The bladder cancer antibodies of the invention specifically bind to bladder cancer 
proteins. By "specifically bind" herein is meant that the antibodies bind to the protein with a 
Kd of at least about 0.1 mM, more usually at least about 1 |iM, preferably at least about 0.1 
|iM or better, and most preferably, 0.01 \jM or better. Selectivity of binding is also 

30 important. 
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Detection of bladder cancer sequence for diagnostic and therapeutic applications 

In one aspect, the RNA expression levels of genes are determined for different 
cellular states in the bladder cancer phenotype. Expression levels of genes in normal tissue 
5 (e.g., not experiencing bladder cancer) and in bladder cancer tissue (and in some cases, for 
varying severities of bladder cancer that relate to prognosis, as outlined below), or in non- 
malignant disease, are evaluated to provide expression profiles. An expression profile of a 
particular cell state or point of development is essentially a "fingerprint" of the state. While 
two states may have a particular gene similarly expressed, the evaluation of a number of 

10 genes simultaneously allows the generation of a gene expression profile that is reflective of 
the state of the cell. By comparing expression profiles of cells in different states, information 
regarding which genes are important (including both up- and down-regulation of genes) in 
each of these states is obtained. Then, diagnosis may be performed or confirmed to 
determine whether a tissue sample has the gene expression profile of normal or cancerous 

1 5 tissue. This will provide for molecular diagnosis of related conditions. 

"Differential expression," or grammatical equivalents as used herein, refers to 
qualitative or quantitative differences in the temporal and/or cellular gene expression 
patterns within and among cells and tissue. Thus, a differentially expressed gene can 
qualitatively have its expression altered, including an activation or inactivation, in, e.g., 

20 normal versus bladder cancer tissue. Genes may be turned on or turned off in a particular 
state, relative to another state thus permitting comparison of two or more states. A 
qualitatively regulated gene will exhibit an expression pattern within a state or cell type 
which is detectable by standard techniques. Some genes will be expressed in one state or cell 
type, but not in both. Alternatively, the difference in expression may be quantitative, e.g., in 

25 that expression is increased or decreased; e.g., gene expression is either upregulated, resulting 
in an increased amount of transcript, or downregulated, resulting in a decreased amount of 
transcript. The degree to which expression differs need only be large enough to quantify via 
standard characterization techniques as outlined below, such as by use of Affymetrix 
GeneChip™ expression arrays. See Lockhart (1996) Nature Biotechnology 14:1675-1680. 

30 Other techniques include, but are not limited to, quantitative reverse transcriptase PCR, 
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northern analysis and RNase protection. As outlined above, preferably the change in 
expression (e.g., upregulation or downregulation) is at least about 50%, more preferably at 
least about 100%, more preferably at least about 150%, more preferably at least about 200%, 
with from about 300-1000% being especially preferred. 
5 Evaluation may be at the gene transcript, or the protein level. The amount of gene 

expression may be monitored using nucleic acid probes to the DNA or RNA equivalent of the 
gene transcript, and the quantification of gene expression levels, or, alternatively, the final 
gene product itself (protein) can be monitored, e.g., with antibodies to the bladder cancer 
protein and standard immunoassays (ELISAs, etc.) or other techniques, including mass 

10 spectroscopy assays, 2D gel electrophoresis assays, etc. Proteins corresponding to bladder 
cancer genes, e.g., those identified as being important in a bladder cancer or disease 
phenotype, can be evaluated in a bladder cancer diagnostic test. 

In a preferred embodiment, gene expression monitoring is performed simultaneously 
on a number of genes. Multiple protein expression monitoring can be performed as well. 

15 Similarly, these assays may be performed on an individual basis as well. 

In this embodiment, the bladder cancer nucleic acid probes are attached to biochips as 
outlined herein for the detection and quantification of bladder cancer sequences in a 
particular cell. The assays are further described below in the example. PCR techniques can 
be used to provide greater sensitivity. 

20 In a preferred embodiment nucleic acids encoding the bladder cancer protein are 

detected. Although DNA or RNA encoding the bladder cancer protein may be detected, of 
particular interest are methods wherein an mRNA encoding a bladder cancer protein is 
detected. Probes to detect mRNA can be a nucleotide/deoxynucleotide probe that is 
complementary to and hybridizes with the mRNA and includes, but is not limited to, 

25 oligonucleotides, cDNA or RNA. Probes also should contain a detectable label, as defined 
herein. In one method the mRNA is detected after immobilizing the nucleic acid to be 
examined on a solid support such as nylon membranes and hybridizing the probe with the 
sample. Following washing to remove the non-specifically bound probe, the label is 
detected. In another method detection of the mRNA is performed in situ. In this method 

30 permeabilized cells or tissue samples are contacted with a detectably labeled nucleic acid 
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probe for sufficient time to allow the probe to hybridize with the target mRNA. Following 
washing to remove the non-specifically bound probe, the label is detected. For example, a 
digoxygenin labeled riboprobe (RNA probe) that is complementary to the mRNA encoding a 
bladder cancer protein is detected by binding the digoxygenin with an anti-digoxygenin 
5 secondary antibody and developed with nitro blue tetrazolium and 5-bromo-4-chIoro-3- 
indoyl phosphate. 

In a preferred embodiment, various proteins from the three classes of proteins as 
described herein (secreted, transmembrane or intracellular proteins) are used in diagnostic 
assays. The bladder cancer proteins, antibodies, nucleic acids, modified proteins and cells 
10 containing bladder cancer sequences are used in diagnostic assays. This can be performed on 
an individual gene or corresponding polypeptide level. In a preferred embodiment, the 
expression profiles are used, preferably in conjunction with high throughput screening 
techniques to allow monitoring for expression profile genes and/or corresponding 
polypeptides. 

15 As described and defined herein, bladder cancer proteins, including intracellular, 

transmembrane or secreted proteins, find use as diagnostic or prognostic markers of bladder 
cancer, or to assist in selecting therpay based on expression profile and archival data. 
Detection of these proteins in putative bladder cancer tissue allows for detection or diagnosis 
of bladder cancer. In one embodiment, antibodies are used to detect bladder cancer proteins. 

20 A preferred method separates proteins from a sample by electrophoresis on a gel (typically a 
denaturing and reducing protein gel, but may be another type of gel, including isoelectric 
focusing gels and the like). Following separation of proteins, the bladder cancer protein is 
detected, e.g., by immunoblotting with antibodies raised against the bladder cancer protein. 
Methods of immunoblotting are well known to those of ordinary skill in the art. 

25 In another preferred method, antibodies to the bladder cancer protein find use in in 

situ imaging techniques, e.g., in histology (e.g., Asai (ed. 1993) "Antibodies in Cell Biology" 
Methods in Cell Biology (vol. 37). In this method cells are contacted with from one to many 
antibodies to the bladder cancer protein(s). Following washing to remove non-specific 
antibody binding, the presence of the antibody or antibodies is detected. In one embodiment 

30 the antibody is detected by incubating with a secondary antibody that contains a detectable 
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label In another method the primary antibody to the bladder cancer protein(s) contains a 
detectable label, e.g. an enzyme marker that can act on a substrate. In another preferred 
embodiment each one of multiple primary antibodies contains a distinct and detectable label. 
This method finds particular use in simultaneous screening for a plurality of bladder cancer 
5 proteins. As will be appreciated by one of ordinary skill in the art, many other histological 
imaging techniques are also provided by the invention. 

In a preferred embodiment the label is detected in a fluorometer which has the ability 
to detect and distinguish emissions of different wavelengths. In addition, a fluorescence 
activated cell sorter (FACS) can be used in the method. 

10 In another preferred embodiment, antibodies find use in diagnosing bladder cancer 

from blood, serum, plasma, stool, urine, and other samples. Such samples, therefore, are 
useful as samples to be probed or tested for the presence of bladder cancer proteins. 
Antibodies can be used to detect a bladder cancer protein by previously described 
immunoassay techniques including ELISA, immunoblotting (western blotting), 

15 immunoprecipitation, BIACORE technology and the like. Conversely, the presence of 

antibodies may indicate an immune response against an endogenous bladder cancer protein. 

In a preferred embodiment, in situ hybridization of labeled bladder cancer nucleic 
acid probes to tissue arrays is done. For example, arrays of tissue samples, including bladder 
cancer tissue and/or normal tissue, are made. In situ hybridization (see, e.g., Ausubel, supra) 

20 is then performed. When comparing the fingerprints between an individual and a standard, 
the skilled artisan can make a diagnosis, a prognosis, or a prediction based on the findings. It 
is further understood that the genes which indicate the diagnosis may differ from those which 
indicate the prognosis and molecular profiling of the condition of the cells may lead to 
distinctions between responsive or refractory conditions or may be predictive of outcomes. 

25 In a preferred embodiment, the bladder cancer proteins, antibodies, nucleic acids, 

modified proteins and cells containing bladder cancer sequences are used in prognosis assays. 
As above, gene expression profiles can be generated that correlate to bladder cancer, clinical, 
pathological, or other information, e.g., in terms of long term prognosis. Again, this may be 
done on either a protein or gene level, with the use of genes being preferred. Single or 

30 multiple genes may be useful in various combinations. As above, bladder cancer probes may 
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be attached to biochips for the detection and quantification of bladder cancer sequences in a 
tissue or patient. The assays proceed as outlined above for diagnosis. PCR methods may 
provide more sensitive and accurate quantification. 

5 Assays for therapeutic compounds 

In a preferred embodiment members of the proteins, nucleic acids, and antibodies as 
described herein are used in drug screening assays. The bladder cancer proteins, antibodies, 
nucleic acids, modified proteins and cells containing bladder cancer sequences are used in 
drug screening assays or by evaluating the effect of drug candidates on a "gene expression 

10 profile" or expression profile of polypeptides. In a preferred embodiment, the expression 
profiles are used, preferably in conjunction with high throughput screening techniques to 
allow monitoring for expression profile genes after treatment with a candidate agent. See, 
e.g., Zlokarnik, et al. (1998) Science 279:84-88; and Heid (1996) Genome Res. 6:986-94. 
In a preferred embodiment, the bladder cancer proteins, antibodies, nucleic acids, 

1 5 modified proteins and cells containing the native or modified bladder cancer proteins are used 
in screening assays. That is, the present invention provides novel methods for screening for 
compositions which modulate the bladder cancer phenotype or an identified physiological 
function of a bladder cancer protein. As above, this can be done on an individual gene level 
or by evaluating the effect of drug candidates on a "gene expression profile". In a preferred 

20 embodiment, the expression profiles are used, preferably in conjunction with high throughput 
screening techniques to allow monitoring for expression profile genes after treatment with a 
candidate agent, see Zlokarnik, supra. 

Having identified the differentially expressed genes herein, a variety of assays may be 
executed. In a preferred embodiment, assays may be run on an individual gene or protein 

25 level. That is, having identified a particular gene as up regulated in bladder cancer, test 

compounds can be screened for the ability to modulate gene expression or for binding to the 
bladder cancer protein. "Modulation" thus includes both an increase and a decrease in gene 
expression. The preferred amount of modulation will depend on the original change of the 
gene expression in normal versus tissue undergoing bladder cancer, with changes of at least 

30 about 10%, preferably about 50%, more preferably about 100-300%, and in some 
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embodiments about 300-1000% or greater. Thus, if a gene exhibits about 4-fold increase in 
bladder cancer tissue compared to normal tissue, a decrease of about four-fold is often 
desired; similarly, about 10-fold decrease in bladder cancer tissue compared to normal tissue 
often provides a target value of about 10-fold increase in expression to be induced by the test 
5 compound. 

The amount of gene expression may be monitored using nucleic acid probes and the 
quantification of gene expression levels, or, alternatively, the gene product itself can be 
monitored, e.g., through the use of antibodies to the bladder cancer protein and standard 
immunoassays. Proteomics and separation techniques may also allow quantification of 
10 expression. 

In a preferred embodiment, gene expression or protein monitoring of a number of 
entities, e.g., an expression profile, is monitored simultaneously. Such profiles will typically 
involve a plurality of those entities described herein.. 

In this embodiment, the bladder cancer nucleic acid probes are attached to biochips as 
15 outlined herein for the detection and quantification of bladder cancer sequences in a 

particular cell. Alternatively, PCR may be used. Thus, a series, e.g., of microtiter plate, may 
be used with dispensed primers in desired wells. A PCR reaction can then be performed and 
analyzed for each well. 

Expression monitoring can be performed to identify compounds that modify the 
20 expression of one or more bladder cancer-associated sequences, e.g., a polynucleotide 

sequence set out inTables 1 A-13. Generally, in a preferred embodiment, a test modulator is 
added to the cells prior to analysis. Moreover, screens are also provided to identify agents 
that modulate bladder cancer, modulate bladder cancer proteins, bind to a bladder cancer 
protein, or interfere with the binding of a bladder cancer protein and an antibody or other 
25 binding partner. 

The term "test compound" or "drug candidate" or "modulator" or grammatical 
equivalents as used herein describes a molecule, e.g., protein, oligopeptide, small organic 
molecule, polysaccharide, polynucleotide, etc., to be tested for the capacity to directly or 
indirectly alter the bladder cancer phenotype or the expression of a bladder cancer sequence, 
30 e.g., a nucleic acid or protein sequence. In preferred embodiments, modulators alter 
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expression profiles, or expression profile nucleic acids or proteins provided herein. In one 
embodiment, the modulator suppresses a bladder cancer phenotype, e.g., to a normal tissue or 
non-malignant fingerprint. In another embodiment, a modulator induced a bladder cancer 
phenotype. Generally, a plurality of assay mixtures are run in parallel with different agent 
5 concentrations to obtain a differential response to the various concentrations. Typically, one 
of these concentrations serves as a negative control, e.g., at zero concentration or below the 
level of detection. 

Drug candidates encompass numerous chemical classes, though typically they are 
organic molecules, preferably small organic compounds having a molecular weight of more 

10 than about 100 and less than about 2,500 daltons. Preferred small molecules are less than 
about 2000, or less than about 1500 or less than about 1000 or less than about 500 D. 
Candidate agents comprise functional groups necessary for structural interaction with 
proteins, particularly hydrogen bonding, and typically include at least an amine, carbonyl, 
hydroxyl, or carboxyl group, preferably at least two of the functional chemical groups. The 

1 5 candidate agents often comprise cyclical carbon or heterocyclic structures and/or aromatic or 
polyaromatic structures substituted with one or more of the above functional groups. 
Candidate agents are also found among biomolecules including peptides, saccharides, fatty 
acids, steroids, purines, pyrimidines, derivatives, structural analogs or combinations thereof. 
Particularly preferred are peptides. 

20 In one aspect, a modulator will neutralize the effect of a bladder cancer protein. By 

"neutralize" is meant that activity of a protein is inhibited or blocked and the consequent 
effect on the cell. 

In certain embodiments, combinatorial libraries of potential modulators will be 
screened for an ability to bind to a bladder cancer polypeptide or to modulate activity. 
25 Conventionally, new chemical entities with useful properties are generated by identifying a 
chemical compound (called a "lead compound") with some desirable property or activity, 
e.g., inhibiting activity, creating variants of the lead compound, and evaluating the property 
and activity of those variant compounds. Often, high throughput screening (HTS) methods 
are employed for such an analysis. 
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In one preferred embodiment, high throughput screening methods involve providing a 
library containing a large number of potential therapeutic compounds (candidate 
compounds). Such "combinatorial chemical libraries" are then screened in one or more 
assays to identify those library members (particular chemical species or subclasses) that 
5 display a desired characteristic activity. The compounds thus identified can serve as 

conventional "lead compounds" or can themselves be used as potential or actual therapeutics. 

A combinatorial chemical library is a collection of diverse chemical compounds 
generated by either chemical synthesis or biological synthesis by combining a number of 
chemical "building blocks" such as reagents. For example, a linear combinatorial chemical 

10 library, such as a polypeptide (e.g., mutein) library, is formed by combining a set of chemical 
building blocks called amino acids in every possible way for a given compound length (e.g., 
the number of amino acids in a polypeptide compound). Millions of chemical compounds 
can be synthesized through such combinatorial mixing of chemical building blocks. See, e.g., 
Gallop, et aL (1994) J. Med. Chem. 37:1233-1251. 

1 5 Preparation and screening of combinatorial chemical libraries is well known to those 

of skill in the art. Such combinatorial chemical libraries include, but are not limited to, 
peptide libraries (see, e.g., U.S. Patent No. 5,010,175, Furka (1991) Pent. Prot. Res. 37:487- 
493, Houghton, et al. (1991) Nature 354:84-88); peptoids (PCT Publication No WO 
91/19735); encoded peptides (PCT Publication WO 93/20242); random bio-oligomers (PCT 

20 Publication WO 92/00091); benzodiazepines (U.S. Pat. No. 5,288,514); diversomers such as 
hydantoins, benzodiazepines, and dipeptides (Hobbs, et al. (1993) Proc. Nat'l Acad. Sci. USA 
90:6909-6913); vinylogous polypeptides (Hagihara, et al. (1992) J. Amer. Chem. Soc. 
1 14:6568-6570); nonpeptidal peptidomimetics with a Beta-D-Glucose scaffolding 
(Hirschmann, et al. (1992) J. Amer. Chem. Soc. 114:9217-9218); analogous organic 

25 syntheses of small compound libraries (Chen, et al. (1994) J. Amer. Chem. Soc. 116:2661- 
2662); oligocarbamates (Cho, et al. (1993^ Science 261:1303-1305); and/or peptidyl 
phosphonates (Campbell, et al. (1994) J. Org. Chem. 59:658-xxx). See, generally, Gordon, et 
al. (1994) J. Med. Chem. 37:1385-1401, nucleic acid libraries (see, e.g., Strategene, Corp.); 
peptide nucleic acid libraries (see, e.g., U.S. Patent 5,539,083); antibody libraries (see, e.g., 

30 Vaughn, et al. (1996) Nature Biotechnology 14:309-314, and PCT/US96/10287); 
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carbohydrate libraries (Liang, et al. (1996) Science 274:1520-1522, and US. Patent No. 
5,593,853); and small organic molecule libraries (see, e.g., benzodiazepines, Baum (p. 33, 
Jan 18, 1993) C&ENews); isoprenoid (U.S. Patent No. 5,569,588); thiazolidinones and 
metathiazanones (U.S. Patent No. 5,549,974); pyrrolidines (U.S. Patent Nos. 5,525,735 and 
5 5,519,134); morpholino compounds (U.S. Patent No. 5,506,337); benzodiazepines (U.S. 
Patent No. 5,288,514); and the like. 

Devices for the preparation of combinatorial libraries are commercially available. 
See, e.g., 357 MPS, 390 MPS, Advanced Chem Tech, Louisville KY; Symphony, Rainin, 
Woburn, MA; 433A Applied Biosystems, Foster City, CA; and 9050 Plus, Miilipore, 
10 Bedford, MA. 

A number of well known robotic systems have also been developed for solution phase 
chemistries. These systems include automated workstations like the automated synthesis 
apparatus developed by Takeda Chemical Industries, LTD. (Osaka, Japan) and many robotic 
systems utilizing robotic arms (Zymate II, Zymark Corporation, Hopkinton, MA; Orca, 

15 Hewlett-Packard, Palo Alto, CA), which mimic the manual synthetic operations performed by 
a chemist. The above devices are suitable for use with the present invention. The nature and 
implementation of modifications to these devices (if any) so that they can operate as 
discussed herein will be apparent to persons skilled in the relevant art. In addition, numerous 
combinatorial libraries are themselves commercially available (see, e.g., ComGenex, 

20 Princeton, NJ; Asinex, Moscow, Ru; Tripos, Inc., St. Louis, MO; ChemStar, Ltd, Moscow, 
RU; 3D Pharmaceuticals, Exton, PA; Martek Biosciences, Columbia, MD, etc.). 

The assays to identify modulators are amenable to high throughput screening. 
Preferred assays thus detect enhancement or inhibition of bladder cancer gene transcription, 
inhibition or enhancement of polypeptide expression, and inhibition or enhancement of 

25 polypeptide activity. 

High throughput assays for the presence, absence, quantification, or other properties 
of particular nucleic acids or protein products are well known to those of skill in the art. 
Similarly, binding assays and reporter gene assays are similarly well known. Thus, e.g., U.S. 
Patent No. 5,559,410 discloses high throughput screening methods for proteins, U.S. Patent 

30 No. 5,585,639 discloses high throughput screening methods for nucleic acid binding (e.g., in 
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arrays), while U.S. Patent Nos. 5,576,220 and 5,541,061 disclose high throughput methods of 
screening for ligand/antibody binding. 

In addition, high throughput screening systems are commercially available (see, e.g., 
Zymark Corp., Hopkinton, MA; Air Technical Industries, Mentor, OH; Beckman 
5 Instruments, Inc. Fullerton, CA; Precision Systems, Inc., Natick, MA; etc.). These systems 
typically automate entire procedures, including all sample and reagent pipetting, liquid 
dispensing, timed incubations, and final readings of the microplate in detector(s) appropriate 
for the assay. These configurable systems provide high throughput and rapid start up as well 
as a high degree of flexibility and customization. The manufacturers of such systems provide 

10 detailed protocols for various high throughput systems. Thus, e.g., Zymark Corp. provides 
technical bulletins describing screening systems for detecting the modulation of gene 
transcription, ligand binding, and the like. 

In one embodiment, modulators are proteins, often naturally occurring proteins or 
fragments of naturally occurring proteins. Thus, e.g., cellular extracts containing proteins, or 

1 5 random or directed digests of proteinaceous cellular extracts, may be used. In this way 

libraries of proteins may be made for screening in the methods of the invention. Particularly 
preferred in this embodiment are libraries of bacterial, fungal, viral, and mammalian proteins, 
with the latter being preferred, and human proteins being especially preferred. Particularly 
usefiil test compound will be directed to the class of proteins to which the target belongs, e.g., 

20 substrates for enzymes or ligands and receptors. 

In a preferred embodiment, modulators are peptides of from about 5-30 amino acids, 
with from about 5-20 amino acids being preferred, and from about. 7-1 5 being particularly 
preferred. The peptides may be digests of naturally occurring proteins as is outlined above, 
random peptides, or "biased" random peptides. By "randomized" or grammatical equivalents 

25 herein is meant that each nucleic acid and peptide consists of essentially random nucleotides 
and amino acids, respectively. Since generally these random peptides (or nucleic acids, 
discussed below) are chemically synthesized, they may incorporate nucleotide or amino acid 
substitutions. The synthetic process can be designed to generate randomized proteins or 
nucleic acids, to allow the formation of all or most of the possible combinations over the 
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length of the sequence, thus forming a library of randomized candidate bioactive 
proteinaceous agents. 

In one embodiment, the library is fully randomized, with no sequence preferences or 
constants. In a preferred embodiment, the library is biased. That is, some positions within 
5 the sequence are either held constant, or are selected from a limited number of possibilities. 
For example, in a preferred embodiment, the nucleotides or amino acid residues are 
randomized within a defined class, e.g., of hydrophobic amino acids, hydrophilic residues, 
sterically biased (either small or large) residues, towards the creation of nucleic acid binding 
domains, the creation of cysteines, for cross-linking, prolines for SH-3 domains, serines, 
1 0 threonines, tyrosines or histidines for phosphorylation sites, etc., or to purines, etc. 

Modulators of bladder cancer can also be nucleic acids, as defined above. 
As described above generally for proteins, nucleic acid modulating agents may be 
naturally occurring nucleic acids, random nucleic acids, or "biased" random nucleic acids. 
Digests of procaryotic or eucaryotic genomes may be used as is outlined above for proteins. 
15 In a preferred embodiment, the candidate compounds are organic chemical moieties, a 

wide variety of which are available in the literature. 

After the candidate agent has been added and the cells allowed to incubate for some 
period of time, the sample containing a target sequence to be analyzed is added to the 
biochip. If required, the target sequence is prepared using known techniques. For example, 
20 the sample may be treated to lyse the cells, using known lysis buffers, electroporation, etc., 
with purification and/or amplification such as PCR performed as appropriate. For example, 
an in vitro transcription with labels covalently attached to the nucleotides is performed. 
Generally, the nucleic acids are labeled with biotin-FITC or PE, or with cy3 or cy5. 

In a preferred embodiment, the target sequence is labeled with, e.g., a fluorescent, a 
25 chemiluminescent, a chemical, or a radioactive signal, to provide a means of detecting the 
target sequence's specific binding to a probe. The label also can be an enzyme, such as, 
alkaline phosphatase or horseradish peroxidase, which when provided with an appropriate 
substrate produces a product that can be detected. Alternatively, the label can be a labeled 
compound or small molecule, such as an enzyme inhibitor, that binds but is not catalyzed or 
30 altered by the enzyme. The label also can be a moiety or compound, such as, an epitope tag 
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or biotin which specifically binds to streptavidin. For the example of biotin, the streptavidin 
is labeled as described above, thereby, providing a detectable signal for the bound target 
sequence. Unbound labeled streptavidin is typically removed prior to analysis. 

As will be appreciated by those in the art, these assays can be direct hybridization 
5 assays or can comprise "sandwich assays", which include the use of multiple probes, as is 
generally outlined in U.S. Patent Nos. 5,681,702, 5,597,909, 5,545,730, 5,594,1 17, 
5,591,584, 5,571,670, 5,580,731, 5,571,670, 5,591,584, 5,624,802, 5,635,352, 5,594,118, 
5,359,100, 5,124,246 and 5,681,697, all of which are hereby incorporated by reference. In 
this embodiment, in general, the target nucleic acid is prepared as outlined above, and then 

1 0 added to the biochip comprising a plurality of nucleic acid probes, under conditions that 
allow the formation of a hybridization complex. 

A variety of hybridization conditions may be used in the present invention, including 
high, moderate and low stringency conditions as outlined above. The assays are generally 
run under stringency conditions which allows formation of the label probe hybridization 

1 5 complex only in the presence of target. Stringency can be controlled by altering a step 
parameter that is a thermodynamic variable, including, but not limited to, temperature, 
formamide concentration, salt concentration, chaotropic salt concentration pH, organic 
solvent concentration, etc. 

These parameters may also be used to control non-specific binding, as is generally 

20 outlined in U.S. Patent No. 5,681,697. Thus it may be desirable to perform certain steps at 
higher stringency conditions to reduce non-specific binding. 

The reactions outlined herein may be accomplished in a variety of ways. Components 
of the reaction may be added simultaneously, or sequentially, in different orders, with 
preferred embodiments outlined below. In addition, the reaction may include a variety of 

25 other reagents. These include salts, buffers, neutral proteins, e.g., albumin, detergents, etc., 
which may be used to facilitate optimal hybridization and detection, and/or reduce non- 
specific or background interactions. Reagents that otherwise improve the efficiency of the 
assay, such as protease inhibitors, nuclease inhibitors, anti-microbial agents, etc., may also be 
used as appropriate, depending on the sample preparation methods and purity of the target. 
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The assay data are analyzed to determine the expression levels, and changes in 
expression levels as between states, of individual genes, forming a gene expression profile. 

Screens are performed to identify modulators of the bladder cancer phenotype. In one 
embodiment, screening is performed to identify modulators that can induce or suppress a 
5 particular expression profile, thus preferably generating the associated phenotype. In another 
embodiment, e.g., for diagnostic applications, having identified differentially expressed genes 
important in a particular state, screens can be performed to identify modulators that alter 
expression of individual genes. In an another embodiment, screening is performed to identify 
modulators that alter a biological function of the expression product of a differentially 
10 expressed gene. Again, having identified the importance of a gene in a particular state, 

screens are performed to identify agents that bind and/or modulate the biological activity of 
the gene product. 

In addition screens can be done for genes that are induced in response to a candidate 
agent. After identifying a modulator based upon its ability to suppress a bladder cancer 

15 expression pattern leading to a normal expression pattern, or to modulate a single bladder 
cancer gene expression profile so as to mimic the expression of the gene from normal tissue, 
a screen as described above can be performed to identify genes that are specifically 
modulated in response to the agent. Comparing expression profiles between normal tissue 
and agent treated bladder cancer tissue reveals genes that are not expressed in normal tissue 

20 or bladder cancer tissue, but are expressed in agent treated tissue. These agent-specific 

sequences can be identified and used by methods described herein for bladder cancer genes or 
proteins. In particular these sequences and the proteins they encode find use in marking or 
identifying agent treated cells. In addition, antibodies can be raised against the agent induced 
proteins and used to target novel therapeutics to the treated bladder cancer tissue sample. 

25 Thus, in one embodiment, a test compound is administered to a population of bladder 

cancer cells, that have an associated bladder cancer expression profile. By "administration" 
or "contacting" herein is meant that the candidate agent is added to the cells in such a manner 
as to allow the agent to act upon the cell, whether by uptake and intracellular action, or by 
action at the cell surface. In some embodiments, nucleic acid encoding a proteinaceous 

30 candidate agent (e.g., a peptide) may be put into a viral construct such as an adenoviral or 
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retroviral construct, and added to the cell, such that expression of the peptide agent is 
accomplished, e.g., PCT US97/01019. Regulatable gene therapy systems can also be used. 

Once the test compound has been administered to the cells, the cells can be washed if 
desired and are allowed to incubate under preferably physiological conditions for some 
5 period of time. The cells are then harvested and a new gene expression profile is generated, 
as outlined herein. 

Thus, e.g., bladder cancer or non-malignant tissue may be screened for agents that 
modulate, e.g., induce or suppress the bladder cancer phenotype. A change in at least one 
gene, preferably many, of the expression profile indicates that the agent has an effect on 

1 0 bladder cancer activity. By defining such a signature for the bladder cancer phenotype, 

screens for new drugs that alter the phenotype can be devised. With this approach, the drug 
target need not be known and need not be represented in the original expression screening 
platform, nor does the level of transcript for the target protein need to change. 

In a preferred embodiment, as outlined above, screens may be done on individual 

15 genes and gene products (proteins). That is, having identified a particular differentially 
expressed gene as important in a particular state, screening of modulators of either the 
expression of the gene or the gene product itself can be done. The gene products of 
differentially expressed genes are sometimes referred to herein as "bladder cancer proteins" 
or a "bladder cancer modulatory protein". The bladder cancer modulatory protein may be a 

20 fragment, or alternatively, be the full length protein to the fragment encoded by the nucleic 
acids of the Tables 1 A-13. Preferably, the bladder cancer modulatory protein is a fragment. 
In a preferred embodiment, the bladder cancer amino acid sequence which is used to 
determine sequence identity or similarity is encoded by a nucleic acid of Tables 1 A-13. In 
another embodiment, the sequences are naturally occurring allelic variants of a protein 

25 encoded by a nucleic acid of Tables 1A-13. In another embodiment, the sequences are 
sequence variants as further described herein. 

Preferably, the bladder cancer modulatory protein is a fragment of approximately 14 
to 24 amino acids long. More preferably the fragment is a soluble fragment. Preferably, the 
fragment includes a non-transmembrane region. In a preferred embodiment, the fragment has 
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an N-terminal Cys to aid in solubility. In one embodiment, the C-terminus of the fragment is 
kept as a free acid and the N-terminus is a free amine to aid in coupling, e.g., to cysteine. 

In one embodiment the bladder cancer proteins are conjugated to an immunogenic 
agent as discussed herein. In one embodiment the bladder cancer protein is conjugated to 
5 BSA. 

Measurements of bladder cancer polypeptide activity, or of bladder cancer or the 
bladder cancer phenotype can be performed using a variety of assays. For example, the 
effects of the test compounds upon the function of the bladder cancer polypeptides can be 
measured by examining parameters described above. A suitable physiological change that 

1 0 affects activity can be used to assess the influence of a test compound on the polypeptides of 
this invention. When the functional consequences are determined using intact cells or 
animals, one can also measure a variety of effects such as, in the case of bladder cancer 
associated with tumors, tumor growth, tumor metastasis, neovascularization, hormone 
release, transcriptional changes to both known and uncharacterized genetic markers (e.g., 

1 5 northern blots), changes in cell metabolism such as cell growth or pH changes, and changes 
in intracellular second messengers such as cGMP. In the assays of the invention, mammalian 
bladder cancer polypeptide is typically used, e.g., mouse, preferably human. 

Assays to identify compounds with modulating activity can be performed in vitro. 
For example, a bladder cancer polypeptide is first contacted with a potential modulator and 

20 incubated for a suitable amount of time, e.g., from 0.5-48 hours. In one embodiment, the 

bladder cancer polypeptide levels are determined in vitro by measuring the level of protein or 
mRNA. The level of protein is measured using immunoassays such as western blotting, 
ELIS A and the like with an antibody that selectively binds to the bladder cancer polypeptide 
or a fragment thereof. For measurement of mRNA, amplification, e.g., using PCR, LCR, or 

25 hybridization assays, e.g., northern hybridization, RNase protection, dot blotting, are 
preferred. The level of protein or mRNA is detected using directly or indirectly labeled 
detection agents, e.g., fluorescently or radioactively labeled nucleic acids, radioactively or 
enzymatically labeled antibodies, and the like, as described herein. 

Alternatively, a reporter gene system can be devised using the bladder cancer protein 

30 promoter operably linked to a reporter gene such as luciferase, green fluorescent protein, 
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CAT, or P-gaL The reporter construct is typically transfected into a cell. After treatment 
with a potential modulator, the amount of reporter gene transcription, translation, or activity 
is measured according to standard techniques known to those of skill in the art. 

In a preferred embodiment, as outlined above, screens may be done on individual 
5 genes and gene products (proteins). That is, having identified a particular differentially 

expressed gene as important in a particular state, screening of modulators of the expression of 
the gene or the gene product itself can be done. The gene products of differentially expressed 
genes are sometimes referred to herein as "bladder cancer proteins." The bladder cancer 
protein may be a fragment, or alternatively, be the full length protein to a fragment shown 
10 herein. 

In one embodiment, screening for modulators of expression of specific genes is 
performed. Typically, the expression of only one or a few genes are evaluated. In another 
embodiment, screens are designed to first find compounds that bind to differentially 
expressed proteins. These compounds are then evaluated for the ability to modulate 
15 differentially expressed activity. Moreover, once initial candidate compounds are identified, 
variants can be further screened to better evaluate structure activity relationships. 

In a preferred embodiment, binding assays are done. In general, purified or isolated 
gene product is used; that is, the gene products of one or more differentially expressed 
nucleic acids are made. For example, antibodies are generated to the protein gene products, 
20 and standard immunoassays are run to determine the amount of protein present. 

Alternatively, cells comprising the bladder cancer proteins can be used in the assays. 

Thus, in a preferred embodiment, the methods comprise combining a bladder cancer 
protein and a candidate compound, and determining the binding of the compound to the 
bladder cancer protein. Preferred embodiments utilize the human bladder cancer protein, 
25 although other mammalian proteins may also be used, e.g., for the development of animal 
models of human disease. In some embodiments, as outlined herein, variant or derivative 
bladder cancer proteins may be used. 

Generally, in a preferred embodiment of the methods herein, the bladder cancer 
protein or the candidate agent is non-diffusably bound to an insoluble support having isolated 
30 sample receiving areas (e.g., a microtiter plate, an array, etc.). The insoluble supports may be 
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made of a composition to which the compositions can be bound, is readily separated from 
soluble material, and is otherwise compatible with the overall method of screening. The 
surface of such supports may be solid or porous and of a convenient shape. Examples of 
suitable insoluble supports include microtiter plates, arrays, membranes and beads. These are 
5 typically made of glass, plastic (e.g., polystyrene), polysaccharides, nylon or nitrocellulose, 
teflon™, etc. Microtiter plates and arrays are especially convenient because a large number 
of assays can be carried out simultaneously, using small amounts of reagents and samples. 
The particular manner of binding of the composition is not crucial so long as it is compatible 
with the reagents and overall methods of the invention, maintains the activity of the 

10 composition and is nondiffiisable. Preferred methods of binding include the use of antibodies 
(which do not sterically block either the ligand binding site or activation sequence when the 
protein is bound to the support), direct binding to "sticky" or ionic supports, chemical 
crosslinking, the synthesis of the protein or agent on the surface, etc. Following binding of 
the protein or agent, excess unbound material is removed by washing. The sample receiving 

1 5 areas may then be blocked through incubation with bovine serum albumin (BS A), casein or 
other innocuous protein or other moiety. 

In a preferred embodiment, the bladder cancer protein is bound to the support, and a 
test compound is added to the assay. Alternatively, the candidate agent is bound to the 
support and the bladder cancer protein is added. Novel binding agents include specific 

20 antibodies, non-natural binding agents identified in screens of chemical libraries, peptide 

analogs, etc. Of particular interest are screening assays for agents that have a low toxicity for 
human cells. A wide variety of assays may be used for this purpose, including labeled in 
vitro protein-protein binding assays, electrophoretic mobility shift assays, immunoassays for 
protein binding, functional assays (phosphorylation assays, etc.) and the like. 

25 The determination of the binding of the test modulating compound to the bladder 

cancer protein may be done in a number of ways. In a preferred embodiment, the compound 
is labeled, and binding determined directly, e.g., by attaching all or a portion of the bladder 
cancer protein to a solid support, adding a labeled candidate agent (e.g., a fluorescent label), 
washing off excess reagent, and determining whether the label is present on the solid support. 

30 Various blocking and washing steps may be utilized as appropriate. 
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In some embodiments, only one of the components is labeled, e.g., the proteins (or 
proteinaceous candidate compounds) can be labeled. Alternatively, more than one 
component can be labeled with different labels, e.g., 125 I for the proteins and a fluorophor 
for the compound. Proximity reagents, e.g., quenching or energy transfer reagents are also 
5 useful. 

In one embodiment, the binding of the test compound is determined by competitive 
binding assay. The competitor is a binding moiety known to bind to the target molecule (e.g., 
a bladder cancer protein), such as an antibody, peptide, binding partner, ligand, etc. Under 
certain circumstances, there may be competitive binding between the compound and the 

10 binding moiety, with the binding moiety displacing the compound. In one embodiment, the 
test compound is labeled. Either the compound, or the competitor, or both, is added first to 
the protein for a time sufficient to allow binding, if present. Incubations may be performed at 
a temperature which facilitates optimal activity, typically between 4 and 40°C. Incubation 
periods are typically optimized, e.g., to facilitate rapid high throughput screening. Typically 

15 between 0.1 and 1 hour will be sufficient. Excess reagent is generally removed or washed 
away. The second component is then added, and the presence or absence of the labeled 
component is followed, to indicate binding. 

In a preferred embodiment, the competitor is added first, followed by the test 
compound. Displacement of the competitor is an indication that the test compound is binding 

20 to the bladder cancer protein and thus is capable of binding to, and potentially modulating, 
the activity of the bladder cancer protein. In this embodiment, either component can be 
labeled. Thus, e.g., if the competitor is labeled, the presence of label in the wash solution 
indicates displacement by the agent. Alternatively, if the test compound is labeled, the 
presence of the label on the support indicates displacement. 

25 In an alternative embodiment, the test compound is added first, with incubation and 

washing, followed by the competitor. The absence of binding by the competitor may indicate 
that the test compound is bound to the bladder cancer protein with a higher affinity. Thus, if 
the test compound is labeled, the presence of the label on the support, coupled with a lack of 
competitor binding, may indicate that the test compound is capable of binding to the bladder 

30 cancer protein. 
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In a preferred embodiment, the methods comprise differential screening to identity 
agents that are capable of modulating the activity of the bladder cancer proteins. In this 
embodiment, the methods comprise combining a bladder cancer protein and a competitor in a 
first sample. A second sample comprises a test compound, a bladder cancer protein, and a 
5 competitor. The binding of the competitor is determined for both samples, and a change, or 
difference in binding between the two samples indicates the presence of an agent capable of 
binding to the bladder cancer protein and potentially modulating its activity. That is, if the 
binding of the competitor is different in the second sample relative to the first sample, the 
agent is capable of binding to the bladder cancer protein. 

10 Alternatively, differential screening is used to identify drug candidates that bind to the 

native bladder cancer protein, but cannot bind to modified bladder cancer proteins. The 
structure of the bladder cancer protein may be modeled, and used in rational drug design to 
synthesize agents that interact with that site. Drug candidates that affect the activity of a 
bladder cancer protein are also identified by screening drugs for the ability to either enhance 

15 or reduce the activity of the protein. 

Positive controls and negative controls may be used in the assays. Preferably control 
and test samples are performed in at least triplicate to obtain statistically significant results. 
Incubation of all samples is for a time sufficient for the binding of the agent to the protein. 
Following incubation, samples are washed free of non-specifically bound material and the 

20 amount of bound, generally labeled agent determined. For example, where a radiolabel is 
employed, the samples may be counted in a scintillation counter to determine the amount of 
bound compound. 

A variety of other reagents may be included in the screening assays. These include 
reagents like salts, neutral proteins, e.g., albumin, detergents, etc. which may be used to 
25 facilitate optimal protein-protein binding and/or reduce non-specific or background 
interactions. Also reagents that otherwise improve the efficiency of the assay, such as 
protease inhibitors, nuclease inhibitors, anti-microbial agents, etc., may be used. The mixture 
of components may be added in an order that provides for the requisite binding. 

In a preferred embodiment, the invention provides methods for screening for a 
30 compound capable of modulating the activity of a bladder cancer protein. The methods 
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comprise adding a test compound, as defined above, to a cell comprising bladder cancer 
proteins. Many different cell types may be transfected to contain a recombinant nucleic acid 
that encodes a bladder cancer protein. In a preferred embodiment, a library of candidate 
agents are tested on a plurality of cells. 
5 In one aspect, the assays are evaluated in the presence or absence or previous or 

subsequent exposure of physiological signals, e.g., hormones, antibodies, peptides, antigens, 
cytokines, growth factors, action potentials, and pharmacological agents including, e.g., 
chemotherapeutics, radiation, carcinogenics, or other cells (e.g., cell-cell contacts). In one 
example, the determinations are determined at different stages of the cell cycle process. 

10 In this way, compounds that modulate bladder cancer agents are identified. 

Compounds with pharmacological activity are able to enhance or interfere with the activity of 
the bladder cancer protein. Once identified, similar structures are evaluated to identify 
critical structural feature of the compound. 

In one embodiment, a method of inhibiting bladder cancer cell division is provided. 

1 5 The method comprises administration of a bladder cancer inhibitor. In another embodiment, 
a method of inhibiting bladder cancer is provided. The method comprises administration of a 
bladder cancer inhibitor. In a further embodiment, methods of treating cells or individuals 
with bladder cancer are provided. The method comprises administration of a bladder cancer 
inhibitor. In one embodiment, a bladder cancer inhibitor is an antibody as discussed above. 

20 In another embodiment, the bladder cancer inhibitor is an antisense molecule. 

A variety of cell growth, proliferation, and metastasis assays are known to those of 
skill in the art, as described below. 
Soft agar growth or colony formation in suspension 

Normal cells require a solid substrate to attach and grow. When the cells are 

25 transformed, they lose this phenotype and grow detached from the substrate. For example, 
transformed cells can grow in stirred suspension culture or suspended in semi-solid media, 
such as semi-solid or soft agar. The transformed cells, when transfected with tumor 
suppressor genes, regenerate normal phenotype and require a solid substrate to attach and 
grow. Soft agar growth or colony formation in suspension assays can be used to identify 

30 modulators of bladder cancer sequences, which when expressed in host cells, inhibit 
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abnormal cellular proliferation and transformation. A therapeutic compound would reduce or 
eliminate the host cells 1 ability to grow in stirred suspension culture or suspended in semi- 
solid media, such as semi-solid or soft. 

Techniques for soft agar growth or colony formation in suspension assays are 
5 described in Freshney (1994) Culture of Animal Cells: A Manual of Basic Technique (3d ed.) 
Wiley-Liss, herein incorporated by reference. See also, the methods section of Garkavtsev, et 
al. (1996), supra, herein incorporated by reference. 
Contact inhibition and density limitation of growth 

Normal cells typically grow in a flat and organized pattern in a petri dish until they 

10 touch other cells. When the cells touch one another, they are contact inhibited and stop 
growing. When cells are transformed, however, the cells are not contact inhibited and 
continue to grow to high densities in disorganized foci. Thus, the transformed cells grow to a 
higher saturation density than normal cells. This can be detected morphologically by the 
formation of a disoriented monolayer of cells or rounded cells in foci within the regular 

15 pattern of normal surrounding cells. Alternatively, labeling index with ( 3 H)-thymidine at 
saturation density can be used to measure density limitation of growth. See Freshney (1994), 
supra. The transformed cells, when transfected with tumor suppressor genes, regenerate a 
normal phenotype and become contact inhibited and would grow to a lower density. 

In this assay, labeling index with ( 3 H)-thymidine at saturation density is a preferred 

20 method of measuring density limitation of growth. Transformed host cells are transfected 
with a bladder cancer-associated sequence and are grown for 24 hours at saturation density in 

non-limiting medium conditions. The percentage of cells labeling with ( 3 H)-thymidine is 

determined autoradiographically. See, Freshney (1994), supra. 

Growth factor or serum dependence 

25 Transformed cells have a lower serum dependence than their normal counterparts 

(see, e.g., Temin (1966) J. Natl Cancer Inst. 37:167-175; Eagle, et al. (1970) J. Exp. Med. 

131 :836-879; Freshney (1994), supra). This is in part due to release of various growth factors 

by the transformed cells. Growth factor or serum dependence of transformed host cells can 

be compared with that of control. 

30 Tumor specific markers levels 
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Tumor cells release an increased amount of certain factors (hereinafter "tumor 
specific markers") than their normal counterparts. For example, plasminogen activator (PA) 
is released from human glioma at a higher level than from normal brain cells. See, e.g., 
"Angiogenesis, tumor vascularization, and potential interference with tumor growth" pp. 178- 
5 1 84 in Mihich (ed. 1985) Biological Responses in Cancer Plenum. Similarly, tumor 
angiogenesis factor (TAF) is released at a higher level in tumor cells than their normal 
counterparts. See, e.g., Folkman (1992) Sem Cancer Biol. 3:89-96. 

Various techniques which measure the release of these factors are described in 
Freshney (1994), supra. See also, Unkeless, et al. (1974) J. Biol. Chem. 249:4295-4305; 

10 Strickland and Beers (1 976) J. Biol. Chem. 251:5694-5702; Whur,-et al. (1980^ Br. J. Cancer 
42:305-312; Gullino "Angiogenesis, tumor vascularization, and potential interference with 
tumor growth" pp. 178-184 in Mihich (ed. 1985) Biological Responses in Cancer, Plenum; 
and Freshney (1985) Anticancer Res . 5:111-130. 
Invasiveness into Matrigel 

15 The degree of invasiveness into Matrigel or some other extracellular matrix 

constituent can be used as an assay to identify compounds that modulate bladder cancer- 
associated sequences. Tumor cells exhibit a good correlation between malignancy and 
invasiveness of cells into Matrigel or some other extracellular matrix constituent. In this 
assay, tumorigenic cells are typically used as host cells. Expression of a tumor suppressor 

20 gene in these host cells would decrease invasiveness of the host cells. 

Techniques described in Freshney (1984), supra, can be used. Briefly, the level of 
invasion of host cells can be measured by using filters coated with Matrigel or some other 
extracellular matrix constituent. Penetration into the gel, or through to the distal side of the 
filter, is rated as invasiveness, and rated histologically by number of cells and distance 

25 moved, or by prelabeling the cells with 125 I and counting the radioactivity on the distal side 
of the filter or bottom of the dish. See, e.g., Freshney (2000), supra. 
Tumor growth in vivo 

Effects of bladder cancer-associated sequences on cell growth can be tested in 
transgenic or immune-suppressed mice. Knock-out transgenic mice can be made, in which 

30 the bladder cancer gene is disrupted or in which a bladder cancer gene is inserted. Knock-out 
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transgenic mice can be made by insertion of a marker gene or other heterologous gene into 
the endogenous bladder cancer gene site in the mouse genome via homologous 
recombination. Such mice can also be made by substituting the endogenous bladder cancer 
gene with a mutated version of the bladder cancer gene, or by mutating the endogenous 
5 bladder cancer gene, e.g., by exposure to carcinogens. 

A DNA construct is introduced into the nuclei of embryonic stem cells. Cells 
containing the newly engineered genetic lesion are injected into a host mouse embryo, which 
is re-implanted into a recipient female. Some of these embryos develop into chimeric mice 
that possess germ cells partially derived from the mutant cell line. By breeding the chimeric 

1 0 mice it is possible to obtain a new line of mice containing the introduced genetic lesion. See, 
e.g., Capecchi, et al. (1989) Science 244:1288-1292. Chimeric targeted mice can be made. 
See Hogan, et al. (1988) Manipulating the Mouse Embryo: A Laboratory Manual CSH Press; 
and Robertson (ed. 1987) Teratocarcinomas and Embryonic Stem Cells: A Practical 
Approach IRL Press, Washington, D.C. 

15 Alternatively, various immune-suppressed or immune-deficient host animals can be 

used. For example, genetically athymic "nude" mouse (see, e.g., Giovanella, et al. (1974) J. 
Naf 1 Cancer Inst. 52:921-930), a SCID mouse, a thymectomized mouse, or an irradiated 
mouse (see, e.g., Bradley, et al. (1978) Br. J. Cancer 38:263-272; Selby, et al. (1980) Br. J. 
Cancer 41 :52-61) can be used as a host. Transplantable tumor cells (typically about 10^ 

20 cells) injected into isogenic hosts will produce invasive tumors in a high proportions of cases, 
while normal cells of similar origin will not. In hosts which developed invasive tumors, cells 
expressing a bladder cancer-associated sequences are injected subcutaneously. After a 
suitable length of time, preferably about 4-8 weeks, tumor growth is measured (e.g., by 
volume or by its two largest dimensions) and compared to the control. Tiimors that have 

25 statistically significant reduction (using, e.g., Student's T test) are said to have inhibited 
growth. 

Polynucleotide modulators of bladder cancer 

Antisense and KNAi Polynucleotides 
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In certain embodiments, the activity of a bladder cancer-associated protein is down- 
regulated, or entirely inhibited, by the use of antisense polynucleotide, e.g., a nucleic acid 
complementary to, and which can preferably hybridize specifically to, a coding mRNA 
nucleic acid sequence, e.g., a bladder cancer protein mRNA, or a subsequence thereof. 
5 Binding of the antisense polynucleotide to the mRNA reduces the translation and/or stability 
of the mRNA. 

In the context of this invention, antisense polynucleotides can comprise naturally- 
occurring nucleotides, or synthetic species formed from naturally-occurring subunits or their 
close homologs. Antisense polynucleotides may also have altered sugar moieties or inter- 

10 sugar linkages. Exemplary among these are the phosphorothioate and other sulfur containing 
species which are known for use in the art. Analogs are comprehended by this invention so 
long as they function effectively to hybridize with the bladder cancer protein mRNA. See, 
e.g., Isis Pharmaceuticals, Carlsbad, CA; Sequitor, Inc., Natick, MA. 

Such antisense polynucleotides can readily be synthesized using recombinant means, 

15 or can be synthesized in vitro. Equipment for such synthesis is sold by several vendors, 
including Applied Biosystems. The preparation of other oligonucleotides such as 
phosphorothioates and alkylated derivatives is also well known. 

Antisense molecules as used herein include antisense or sense oligonucleotides. 
Sense oligonucleotides can, e.g., be employed to block transcription by binding to the anti- 

20 sense strand. The antisense and sense oligonucleotide comprise a single-stranded nucleic 
acid sequence (either RNA or DNA) capable of binding to target mRNA (sense) or DNA 
(antisense) sequences for bladder cancer molecules. A preferred antisense molecule is for a 
bladder cancer sequences in Tables 1A-13, or for a ligand or activator thereof. Antisense or 
sense oligonucleotides, according to the present invention, comprise a fragment generally at 

25 least about 14 nucleotides, preferably from about 14 to 30 nucleotides. The ability to derive 
an antisense or a sense oligonucleotide, based upon a cDNA sequence encoding a given 
protein is described in, e.g., Stein and Cohen (1988) Cancer Res. 48:2659-2668; and van der 
Krol, et al. (1988) BioTechniques 6:958-976. 

RNA interference is a mechanism to suppress gene expression in a sequence specific 

30 manner. See, e.g., Brumelkamp, et al. (2002) Sciencexnress (21March2002); Sharp (1999) 
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Genes Dev. 13:139-141; and Cathew (2001) Curr. Op. Cell Biol. 13:244-248. In mammalian 
cells, short, e.g., 21 nt, double stranded small interfering RNAs (siRNA) have been shown to 
be effective at inducing an RNAi response. See, e.g., Elbashir, et al. (2001) Nature 41 1 :494- 
498. The mechanism may be used to downregulate expression levels of identified genes, e.g., 
5 treatment of or validation of relevance to disease 
Ribozvmes 

In addition to antisense polynucleotides, ribozymes can be used to target and inhibit 
transcription of bladder cancer-associated nucleotide sequences. A ribozyme is an RNA 
molecule that catalytically cleaves other RNA molecules. Different kinds of ribozymes have 

10 been described, including group I ribozymes, hammerhead ribozymes, hairpin ribozymes, 
RNase P, and axhead ribozymes. See, e.g., Castanotto, et al. (1994) Adv. in Pharmacology 
25: 289-317 for a general review of the properties of different ribozymes. 

The general features of hairpin ribozymes are described, e.g., in Hampel, et al. (1990) 
Nucl. Acids Res. 18:299-304; European Patent Publication No. 0 360 257; U.S. Patent No. 

15 5,254,678. Methods of preparing them are well known. See, e.g., WO 94/26877; Ojwang, et 
al. (1993) Proc. Nat f l Acad. Sci. USA 90:6340-6344; Yamada, et al. (1994) Human Gene 
Therapy 1:39-45; Leavitt, et al.(1995) Proc. Nat'l Acad. Sci. USA 92:699-703; Leavitt, et al. 
(1994) Human Gene Therapy 5:1151-120; and Yamada, et al. (1994) Virology 205: 121-126. 
Polynucleotide modulators of bladder cancer may be introduced into a cell containing 

20 the target nucleotide sequence by formation of a conjugate with a ligand binding molecule, as 
described in WO 91/04753. Suitable ligand binding molecules include, but are not limited to, 
cell surface receptors, growth factors, other cytokines, or other ligands that bind to cell 
surface receptors. Preferably, conjugation of the ligand binding molecule does not 
substantially interfere with the ability of the ligand binding molecule to bind to its 

25 corresponding molecule or receptor, or block entry of the sense or antisense oligonucleotide 
or its conjugated version into the cell. Alternatively, a polynucleotide modulator of bladder 
cancer may be introduced into a cell containing the target nucleic acid sequence, e.g., by 
formation of an polynucleotide-lipid complex, as described in WO 90/10448. It is 
understood that the use of antisense molecules or knock out and knock in models may also be 

30 used in screening assays as discussed above, in addition to methods of treatment. 

81 



WO 03/003906 



PCT/US02/21338 



Thus, in one embodiment, methods of modulating bladder cancer in cells or 
organisms are provided. In one embodiment, the methods comprise administering to a cell an 
anti-bladder cancer antibody that reduces or eliminates the biological activity of an 
endogenous bladder cancer protein. Alternatively, the methods comprise administering to a 
5 cell or organism a recombinant nucleic acid encoding a bladder cancer protein. This may be 
accomplished in many ways. In a preferred embodiment, e.g., when the bladder cancer 
sequence is down-regulated in bladder cancer, such state may be reversed by increasing the 
amount of bladder cancer gene product in the cell. This can be accomplished, e.g., by 
overexpressing the endogenous bladder cancer gene or administering a gene encoding the 

1 0 bladder cancer sequence, using known gene-therapy techniques. In a preferred embodiment, 
the gene therapy techniques include the incorporation of the exogenous gene using enhanced 
homologous recombination (EHR), e.g., as described in PCI7US93/03868, hereby 
incorporated by reference in its entirety. Alternatively, e.g., when the bladder cancer 
sequence is up-regulated in bladder cancer, the activity of the endogenous bladder cancer 

1 5 gene is decreased, e.g., by the administration of a bladder cancer antisense nucleic acid. 

In one embodiment, the bladder cancer proteins of the present invention may be used 
to generate polyclonal and monoclonal antibodies to bladder cancer proteins. Similarly, the 
bladder cancer proteins can be coupled, using standard technology, to affinity 
chromatography columns. These columns may then be used to purify bladder cancer 

20 antibodies useful for production, diagnostic, or therapeutic purposes. In a preferred 

embodiment, the antibodies are generated to epitopes unique to a bladder cancer protein; that 
is, the antibodies show little or no cross-reactivity to other proteins. The bladder cancer 
antibodies may be coupled to standard affinity chromatography columns and used to purify 
bladder cancer proteins. The antibodies may also be used as blocking polypeptides, as 

25 outlined above, since they will specifically bind to the bladder cancer protein. 

Methods of identifying variant bladder cancer-associated sequences 

Without being bound by theory, expression of various bladder cancer sequences is 
correlated with bladder cancer. Accordingly, disorders based on mutant or variant bladder 
30 cancer genes may be determined. In one embodiment, the invention provides methods for 
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identifying cells containing variant bladder cancer genes, e.g., determining all or part of the 
sequence of at least one endogenous bladder cancer genes in a cell. This may be 
accomplished using many sequencing techniques. In a preferred embodiment, the invention 
provides methods of identifying the bladder cancer genotype of an individual, e.g., 
5 determining all or part of the sequence of at least one bladder cancer gene of the individual. 
This is generally done in at least one tissue of the individual, and may include the evaluation 
of a number of tissues or different samples of the same tissue. The method may include 
comparing the sequence of the sequenced bladder cancer gene to a known bladder cancer 
gene, e.g., a wild-type gene. 

10 The sequence of all or part of the bladder cancer gene can then be compared to the 

sequence of a known bladder cancer gene to determine if differences exist. This can be done 
using many known homology programs, such as Bestfit, etc. In a preferred embodiment, the 
presence of a difference in the sequence between the bladder cancer gene of the patient and 
the known bladder cancer gene correlates with a disease state or a propensity for a disease 

1 5 state, as outlined herein. 

In a preferred embodiment, the bladder cancer genes are used as probes to determine 
the number of copies of the bladder cancer gene in the genome. 

In another preferred embodiment, the bladder cancer genes are used as probes to 
determine the chromosomal localization of the bladder cancer genes. Information such as 

20 chromosomal localization finds use in providing a diagnosis or prognosis in particular when 
chromosomal abnormalities such as translocations, and the like are identified in the bladder 
cancer gene locus. 

Administration of pharmaceutical and vaccine compositions 

25 In one embodiment, a therapeutically effective dose of a bladder cancer protein or 

modulator thereof, is administered to a patient. By "therapeutically effective dose" herein is 
meant a dose that produces effects for which it is administered. The exact dose will depend 
on the purpose of the treatment, and will be ascertainable by one skilled in the art using 
known techniques. See, e.g., Ansel, et al. (1999) Pharmaceutical Dosage Forms and Drug 

30 Delivery Lippincott; Lieberman (1992) Pharmaceutical Dosage Forms (vols. 1-3) Dekker, 
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ISBN 0824770846, 082476918X, 0824712692, 0824716981; Lloyd (1999) The Art. Science 
and Technology of Pharmaceutical Compounding Amer. Pharma. Assn.; and Pickar (1999) 
Dosage Calculations Thomson. As is known in the art, adjustments for bladder cancer 
degradation, systemic versus localized delivery, and rate of new protease synthesis, as well as 
5 the age, body weight, general health, sex, diet, time of administration, drug interaction and 
the severity of the condition may be necessary, and will be ascertainable with routine 
experimentation by those skilled in the art. USSN 09/687,576, further discloses the use of 
compositions and methods of diagnosis and treatment in bladder cancer and is hereby 
expressly incorporated by reference. 

10 A "patient" for the purposes of the present invention includes both humans and other 

animals, particularly mammals. Thus the methods are applicable to both human therapy and 
veterinary applications. In the preferred embodiment the patient is a mammal, preferably a 
primate, and in the most preferred embodiment the patient is human. 

The administration of the bladder cancer proteins and modulators thereof of the 

15 present invention can be done in a variety of ways as discussed above, including, but not 
limited to, orally, subcutaneously, intravenously, intranasally, transdermally, 
intraperitoneally, intramuscularly, intrapulmonary, vaginally, rectally, or intraocularly. In 
some instances, e.g., in the treatment of wounds and inflammation, the bladder cancer 
proteins and modulators may be directly applied as a solution or spray. 

20 The pharmaceutical compositions of the present invention comprise a bladder cancer 

protein in a form suitable for administration to a patient. In the preferred embodiment, the 
pharmaceutical compositions are in a water soluble form, such as being present as 
pharmaceutically acceptable salts, which is meant to include both acid and base addition 
salts. "Pharmaceutically acceptable acid addition salt" refers to those salts that retain the 

25 biological effectiveness of the free bases and that are not biologically or otherwise 

undesirable, formed with inorganic acids such as hydrochloric, acid, hydrobromic acid, 
sulfuric acid, nitric acid, phosphoric acid and the like, and organic acids such as acetic acid, 
propionic acid, glycolic acid, pyruvic acid, oxalic acid, maleic acid, malonic acid, succinic 
acid, fumaric acid, tartaric acid, citric acid, benzoic acid, cinnamic acid, mandelic acid, 

30 methanesulfonic acid, ethanesulfonic acid, p-toluenesulfonic acid, salicylic acid and the like. 
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"Pharmaceutically acceptable base addition salts 11 include those derived from inorganic bases 
such as sodium, potassium, lithium, ammonium, calcium, magnesium, iron, zinc, copper, 
manganese, aluminum salts and the like. Particularly preferred are the ammonium, 
potassium, sodium, calcium, and magnesium salts. Salts derived from pharmaceutically 
5 acceptable organic non-toxic bases include salts of primary, secondary, and tertiary amines, 
substituted amines including naturally occurring substituted amines, cyclic amines and basic 
ion exchange resins, such as isopropylamine, trimethylamine, diethylamine, triethylamine, 
tripropylamine, and ethanolamine. 

The pharmaceutical compositions may also include one or more of the following: 

10 carrier proteins such as serum albumin; buffers; fillers such as microcrystalline cellulose, 
lactose, corn and other starches; binding agents; sweeteners and other flavoring agents; 
coloring agents; and polyethylene glycol. 

The pharmaceutical compositions can be administered in a variety of unit dosage 
forms depending upon the method of administration. For example, unit dosage forms 

1 5 suitable for oral administration include, but are not limited to, powder, tablets, pills, capsules 
and lozenges. It is recognized that bladder cancer protein modulators (e.g., antibodies, 
antisense constructs, ribozymes, small organic molecules, etc.) when administered orally, 
should be protected from digestion. This is typically accomplished either by complexing the 
molecule(s) with a composition to render it resistant to acidic and enzymatic hydrolysis, or by 

20 packaging the molecule(s) in an appropriately resistant carrier, such as a liposome or a 
protection barrier. Means of protecting agents from digestion are well known in the art. 

The compositions for administration will commonly comprise a bladder cancer 
protein modulator dissolved in a pharmaceutically acceptable carrier, preferably an aqueous 
carrier. A variety of aqueous carriers can be used, e.g., buffered saline and the like. These 

25 solutions are sterile and generally free of undesirable matter. These compositions may be 
sterilized by conventional, well known sterilization techniques. The compositions may 
contain pharmaceutically acceptable auxiliary substances as required to approximate 
physiological conditions such as pH adjusting and buffering agents, toxicity adjusting agents 
and the like, e.g., sodium acetate, sodium chloride, potassium chloride, calcium chloride, 

30 sodium lactate and the like. The concentration of active agent in these formulations can vary 
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widely, and will be selected primarily based on fluid volumes, viscosities, body weight and 
the like in accordance with the particular mode of administration selected and the patient's 
needs (e.g., Remington's Pharmaceutical Science (15th ed., 1980) and Hardman and Limbird 
(eds. 2001) Goodman and Gilman: The Pharmacologial Basis of Therapeutics McGraw-Hill. 
5 Thus, a typical pharmaceutical composition for intravenous administration would be 

about 0. 1-10 mg per patient per day. Dosages from about 0. 1-100 mg per patient per day 
may be used, particularly when the drug is administered to a secluded site and not into the 
blood stream, such as into a body cavity or into a lumen of an organ. Substantially higher 
dosages are possible in topical administration. Actual methods for preparing parenterally 

1 0 administrable compositions will be known or apparent to those skilled in the art, e.g., 

Remington's Pharmaceutical Science and Goodman and Gilman: The Pharmacologial Basis 
of Therapeutics, supra. 

The compositions containing modulators of bladder cancer proteins can be 
administered for therapeutic or prophylactic treatments. In therapeutic applications, 

15 compositions are administered to a patient suffering from a disease (e.g., a cancer) in an 
amount sufficient to cure or at least partially arrest the disease and its complications. An 
amount adequate to accomplish this is defined as a "therapeutically effective dose." Amounts 
effective for this use will depend upon the severity of the disease and the general state of the 
patient's health. Single or multiple administrations of the compositions may be administered 

20 depending on the dosage and frequency as required and tolerated by the patient. The 

composition should provide a sufficient quantity of the agents of this invention to effectively 
treat the patient. An amount of modulator that is capable of preventing or slowing the 
development of cancer in a mammal is referred to as a "prophylactically effective dose." The 
particular dose required for a prophylactic treatment will depend upon the medical condition 

25 and history of the mammal, the particular cancer being prevented, as well as other factors 
such as age, weight, gender, administration route, efficiency, etc. Such prophylactic 
treatments may be used, e.g., in a mammal who has previously had cancer to prevent a 
recurrence of the cancer, or in a mammal who is suspected of having a significant likelihood 
of developing cancer based, at least in part, upon gene expression profiles. Vaccine strategies 

30 may be used, in either a DNA vaccine form, or protein vaccine. 
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It will be appreciated that the present bladder cancer protein-modulating compounds 
can be administered alone or in combination with additional bladder cancer modulating 
compounds or with other therapeutic agent, e.g., other anti-cancer agents or treatments. 
In numerous embodiments, one or more nucleic acids, e.g., polynucleotides 
5 comprising nucleic acid sequences set forth in Tables 1A-13, such as antisense 

polynucleotides or ribozymes, will be introduced into cells, in vitro or in vivo. The present 
invention provides methods, reagents, vectors, and cells useful for expression of bladder 
cancer-associated polypeptides and nucleic acids using in vitro (cell-free), ex vivo, or in vivo 
(cell or organism-based) recombinant expression systems. 

1 0 The particular procedure used to introduce the nucleic acids into a host cell for 

expression of a protein or nucleic acid is application specific. Many procedures for 
introducing foreign nucleotide sequences into host cells may be used. These include the use 
of calcium phosphate transfection, spheroplasts, electroporation, liposomes, microinjection, 
plasma vectors, viral vectors, and other methods for introducing cloned genomic DNA, 

15 cDNA, synthetic DNA or other foreign genetic material into a host cell. See, e.g., Berger and 
Kimmel (1987) Guide to Molecular Cloning Techniques from Methods in Enzvmologv (vol. 
152) Academic Press; Ausubel, et al. (eds. 1999 and supplements) Current Protocols in 
Molecular Biology Lippincott; and Sambrook, et al. (1989) Molecular Cloning: A Laboratory 
Manual (2d ed., Vol. 1-3) CSH Press. 

20 In a preferred embodiment, bladder cancer proteins and modulators are administered 

as therapeutic agents, and can be formulated as outlined above. Similarly, bladder cancer 
genes (including both the full-length sequence, partial sequences, or regulatory sequences of 
the bladder cancer coding regions) can be administered in a gene therapy application. These 
bladder cancer genes can include antisense applications, either as gene therapy (e.g., for 

25 incorporation into the genome) or as antisense compositions, as will be appreciated by those 
in the art. 

Bladder cancer polypeptides and polynucleotides can also be administered as vaccine 
compositions to stimulate HTL, CTL, and antibody responses.. Such vaccine compositions 
can include, e.g., lipidated peptides (Vitiello, et al. (1995) J. Clin. Invest. 95:341-349); 
30 peptide compositions encapsulated in poly(DL-lactide-co-glycolide) ("PLG") microspheres 
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(Eldridge, et al. (1991) Molec. Immunol. 28:287-294; Alonso, et al. (1994) Vaccine 12:299- 
306; Jones, et al. (1995) Vaccine 13:675-681); peptide compositions contained in immune 
stimulating complexes (ISCOMS) (Takahashi, et al (1990) Nature 344:873-875; Hu, et al. 
(1998) Clin. Exp. Immunol. 1 13:235-243); multiple antigen peptide systems (MAPs) (Tarn 
5 (1988) Proc. Nat'l Acad. Sci. USA 85:5409-5413; Tarn (1996) J. Immunol. Methods 196:17- 
32); peptides formulated as multivalent peptides; peptides for use in ballistic delivery 
systems, typically crystallized peptides, viral delivery vectors (Perkus, et al. in Kaufmann 
(ed. 1996) Concepts in Vaccine Development de Gruyter; Chakrabarti, et al. (1986) Nature 
320:535-537; Hu, et al. (1986) Nature 320:537-547; Kieny, et al. (1986) AIDS . 

10 Biotechnology 4:790; Top, et al. (1971) J. Infect. Pis. 124:148-154; Chanda, et al. (1990) 
Virology 175:535-547), particles of viral or synthetic origin (see, e.g., Kofler, et al. (1996) J. 
Immunol. Methods 192:25-35; Eldridge, et al. (1993) Sem. Hematol. 30:16-24; Falo, et al. 
(1995) Nature Med. 7:649-653), adjuvants (Warren, et al. (1986) Annu. Rev. Immunol. 
4:369-388; Gupta, et al. (1993) Vaccine 1 1 :293-306), liposomes (Reddy, et al. (1992) L 

15 Immunol 148:1585-1589; Rock (1996) Immunol. Today 17:131-137), or, naked or particle 
absorbed cDNA (Ulmer, et al. (1993) Science 259:1745-1749; Robinson, et al. (1993) 
Vaccine 1 1 :957-960; Shiver, et al. in Kaufmann (ed. 1996) Concepts in Vaccine 
Development de Gruyter; Cease and Berzofsky (1994) Annu. Rev. Immunol. 12:923-989; 
and Eldridge, et al. (1993) Sem. Hematol. 30:16-24). Toxin-targeted delivery technologies, 

20 also known as receptor mediated targeting, such as those of Avant Immunotherapeutics, Inc., 
Needham, MA, may also be used. 

Vaccine compositions often include adjuvants. Many adjuvants contain a substance 
designed to protect the antigen from rapid catabolism, such as aluminum hydroxide or 
mineral oil, and a stimulator of immune responses, such as lipid A, Bortadella pertussis or 

25 ; Mycobacterium tuberculosis derived proteins. Certain adjuvants are commercially available 
as, e.g., Freund's Incomplete Adjuvant and Complete Adjuvant (Difco Laboratories, Detroit, 
MI); Merck Adjuvant 65 (Merck and Company, Inc., Rahway, NJ); AS-2 (SmithKline 
Beecham, Philadelphia, PA); aluminum salts such as aluminum hydroxide gel (alum) or 
aluminum phosphate; salts of calcium, iron or zinc; an insoluble suspension of acylated 

30 tyrosine; acylated sugars; cationically or anionically derivatized polysaccharides; 
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polyphosphazenes; biodegradable microspheres; monophosphoryl lipid A and quil A. 
Cytokines, such as GM-CSF, interleukin-2, -7, -12, and other like growth factors, may also be 
used as adjuvants. 

Vaccines can be administered as nucleic acid compositions wherein DNA or RNA 
5 encoding one or more of the polypeptides, or a fragment thereof, is administered to a patient. 
This approach is described, for instance, in Wolff, et al. (1990) Science 247:1465-1468 as 
well as U.S. Patent Nos. 5,580,859; 5,589,466; 5,804,566; 5,739,118; 5,736,524; 5,679,647; 
WO 98/04720; and in more detail below. Examples of DNA-based delivery technologies 
include "naked DNA", facilitated (bupivicaine, polymers, peptide-mediated) delivery, 

10 cationic lipid complexes, and particle-mediated ("gene gun") or pressure-mediated delivery 
(see, e.g., U.S. Patent No. 5,922,687). 

For therapeutic or prophylactic immunization purposes, the peptides of the invention 
can be expressed by viral or bacterial vectors. Examples of expression vectors include 
attenuated viral hosts, such as vaccinia or fowlpox. This approach involves the use of 

15 vaccinia virus, e.g., as a vector to express nucleotide sequences that encode bladder cancer 
polypeptides or polypeptide fragments. Upon introduction into a host, the recombinant 
vaccinia virus expresses the immunogenic peptide, and thereby elicits an immune response. 
Vaccinia vectors and methods useful in immunization protocols are described in, e.g., U.S. 
Patent No. 4,722,848. Another vector is BCG (Bacille Calmette Guerin). BCG vectors are 

20 described in Stover, et al. (1991) Nature 351:456-460. A wide variety of other vectors useful 
for therapeutic administration or immunization, e.g., adeno and adeno-associated virus 
vectors, retroviral vectors, Salmonella typhi vectors, detoxified anthrax toxin vectors, and the 
like. See, e.g., Shata, et al. (2000) Mol Med Today 6:66-71; Shedlock, et al. (2000) h 
Leukoc. Biol. 68:793-806; Hipp, et al. (2000) In Vivo 14:571-85. 

25 Methods for the use of genes as DNA vaccines are well known, and include placing a 

bladder cancer gene or portion of a bladder cancer gene under the control of a regulatable 
promoter or a tissue-specific promoter for expression in a bladder cancer patient. The 
bladder cancer gene used for DNA vaccines can encode full-length bladder cancer proteins, 
but more preferably encodes portions of the bladder cancer proteins including peptides 

30 derived from the bladder cancer protein. In one embodiment, a patient is immunized with a 
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DNA vaccine comprising a plurality of nucleotide sequences derived from a bladder cancer 
gene. For example, bladder cancer-associated genes or sequence encoding sub fragments of a 
bladder cancer protein are introduced into expression vectors and tested for their 
immunogenicity in the context of Class I MHC and an ability to generate cytotoxic T cell 
5 responses. This procedure provides for production of cytotoxic T cell responses against cells 
which present antigen, including intracellular epitopes. 

In a preferred embodiment, the DNA vaccines include a gene encoding an adjuvant 
molecule with the DNA vaccine. Such adjuvant molecules include cytokines that increase 
the immunogenic response to the bladder cancer polypeptide encoded by the DNA vaccine. 

10 Additional or alternative adjuvants are available. 

In another preferred embodiment bladder cancer genes find use in generating animal 
models of bladder cancer. When the bladder cancer gene identified is repressed or 
diminished in cancer tissue, gene therapy technology, e.g., wherein antisense RNA directed 
to the bladder cancer gene will also diminish or repress expression of the gene. Animal 

15 models of bladder cancer find use in screening for modulators of a bladder cancer-associated 
sequence or modulators of bladder cancer. Similarly, transgenic animal technology including 
gene knockout technology, e.g., as a result of homologous recombination with an appropriate 
gene targeting vector, will result in the absence or increased expression of the bladder cancer 
protein. When desired, tissue-specific expression or knockout of the bladder cancer protein 

20 may be necessary. 

It is also possible that the bladder cancer protein is overexpressed in bladder cancer. 
As such, transgenic animals can be generated that overexpress the bladder cancer protein. 
Depending on the desired expression level, promoters of various strengths can be employed 
to express the transgene. Also, the number of copies of the integrated transgene can be 

25 determined and compared for a determination of the expression level of the transgene. 
Animals generated by such methods find use as animal models of bladder cancer and are 
additionally useful in screening for modulators to treat bladder cancer. 

Kits for Use in Diagnostic and/or Prognostic Applications 

90 



WO 03/003906 



PCT7US02/21338 



For use in diagnostic, research, and therapeutic applications suggested above, kits are 
also provided by the invention. In the diagnostic and research applications such kits may 
include one or more of the following: assay reagents, buffers, bladder cancer-specific nucleic 
acids or antibodies, hybridization probes and/or primers, antisense or inhibitory 
5 polynucleotides, ribozymes, dominant negative bladder cancer polypeptides or 

polynucleotides, small molecules inhibitors of bladder cancer-associated sequences etc. A 
therapeutic product may include sterile saline or another pharmaceutically acceptable 
emulsion and suspension base. 

In addition, the kits may include instructional materials containing directions (e.g., 

1 0 protocols) for the practice of the methods of this invention. While the instructional materials 
typically comprise written or printed materials they are not limited to such. A medium 
capable of storing such instructions and communicating them to an end user is contemplated 
by this invention. Such media include, but are not limited to electronic storage media (e.g., 
magnetic discs, tapes, cartridges, chips), optical media (e.g., CD ROM), and the like. Such 

1 5 media may include addresses to internet sites that provide such instructional materials. 

The present invention also provides for kits for screening for modulators of bladder 
cancer-associated sequences. Such kits can be prepared from readily available materials and 
reagents. For example, such kits can comprise one or more of the following materials: a 
bladder cancer-associated polypeptide or polynucleotide, control positive or negative 

20 samples, reaction tubes, and instructions for testing bladder cancer-associated activity. 

Optionally, the kit contains biologically active bladder cancer protein. A wide variety of kits 
and components can be prepared according to the present invention, depending upon the 
intended user of the kit and the particular needs of the user. Diagnosis would typically 
involve evaluation of a plurality of genes or products. The genes will be selected based on 

25 correlations with important parameters in disease which may be identified in historical or 
outcome data. 
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EXAMPLES 

Example 1 : Gene Chip Analysis 

Molecular profiles of various normal and cancerous tissues were determined and analyzed 
using gene chips. RNA was isolated and gene chip analysis was performed as described 
5 (Glynne, et al. (2000) Nature 403:672-676; Zhao, et al. (2000) Genes Dev. 14:981-993). 



TABLE DESCRIPTIONS 
Table 1 A shows about 3413 that exhibit increased or decreased expression in bladder cancer 
10 samples. See USSN 60/302,814. 

Table 2A shows about 485 genes overexpressed in bladder tumors relative to normal tissues 
as analyzed using the Affymetrix/Eos Hu03 GeneChip array. See USSN 60/343,705. 

15 Table 3 A shows about 414 genes upregulated in bladder cancer relative to normal body 
tissues and preferred for utility as small molecule, antibody, DNA vaccine targets for the 
therapy of bladder cancer. These genes were selected from 59680 probesets on the 
Eos/Affymetrix Hu03 Genechip array. Gene expression data for each probeset obtained from 
this analysis was expressed as average intensity (Al), a normalized value reflecting the 

20 relative level of mRNA expression. 

Table 4A shows about 129 genes upregulated in bladder cancer relative to normal body 
tissues and preferred for utility as diagnostics of bladder cancer. These genes were selected 
from 59680 probesets on the Eos/Affymetrix Hu03 Genechip array. Gene expression data for 
25 each probeset obtained from this analysis was expressed as average intensity (Al), a 
normalized value reflecting the relative level of mRNA expression. 

Table 5 A shows about 149 genes upregulated in bladder cancer relative to normal body 
tissues. These genes were selected from 59680 probesets on the Eos/Affymetrix Hu03 
30 Genechip array. Gene expression data for each probeset obtained from this analysis was 

expressed as average intensity (Al), a normalized value reflecting the relative level of mRNA 
expression. 

Table 6 A shows about 199 genes upregulated in bladder cancer relative to normal bladder 
35 tissue. These genes were selected from 59680 probesets on the Eos/Affymetrix Hu03 
Genechip array. Gene expression data for each probeset obtained from this analysis was 
expressed as average intensity (Al), a normalized value reflecting the relative level of mRNA 
expression. 

40 Table 7 A shows about 63 genes downregulated in bladder tumors relative to normal bladder. 
These genes were selected from 59680 probesets on the Eos/Affymetrix Hu03 Genechip 
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array. Gene expression data for each probeset obtained from this analysis was expressed as 
average intensity (AI), a normalized value reflecting the relative level of mRNA expression. 

Table 8A shows about 1440 genes upregulated in Ta or Tl bladder tumors from patients who 
5 later presented with muscle-invasive bladder tumors (stage T2-T4). Gene expression data for 
each probeset obtained from this analysis was expressed as average intensity (AI), a 
normalized value reflecting the relative level of mRNA expression. 

Table 9 A shows about 1200 genes upregulated in Ta or Tl tumors of patients who later 
10 presented with either more Ta tumors or no tumors at all. Gene expression data for each 
probeset obtained from this analysis was expressed as average intensity (AI), a normalized 
value reflecting the relative level of mRNA expression. 

Table 10A shows about 65 genes upregulated in non-invasive exophytic Ta bladder tumors 
1 5 relative to ! T2-T4 muscle-invasive tumors. Gene expression data for each probeset obtained 
from this analysis was expressed as average intensity (AI), a normalized value reflecting the 
relative level of mRNA expression. 

Table 1 1 A shows about 106 genes upregulated in muscle-invasive T2-T4 bladder tumors 
20 relative to non-invasive exophytic Ta bladder tumors. Gene expression data for each 

probeset obtained from this analysis was expressed as average intensity (AI), a normalized 
value reflecting the relative level of mRNA expression. 

Table 12A shows the Pkey, ExAccn, UnigenelD, and Unigene Title for all of the sequences 
25 in Table 13. Seq ID No. is used to link Table 12A to table 13. 

Tables 1B-12B show the accession numbers for those Pkey's lacking UnigenelD's for tables 
1 A -12A, respectively. For each probeset is listed a gene cluster number from which 
oligonucleotides were designed. Gene clusters were compiled using sequences derived from 
30 Genbank ESTs and mRNAs. These sequences were clustered based on sequence similarity 
using Clustering and Alignment Tools (DoubleTwist, Oakland California). Genbank 
accession numbers for sequences comprising each cluster are listed in the "Accession" 
column. 

35 Tables 1C-12C show genomic positioning for Pkey's lacking Unigene ID's and accession 
numbers for tables 1 A -12 A, respectively. For each predicted exon, is listed genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 

40 T&B1E1A; 



Pkay: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

45 Unigene Title: Unigene gene title 

R1: please refer to original application 

R£ please refer to original application 

Target Type: downregutate stage If target is downregulated in bladder tumors relative to normal bladder or 
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early stage tf target Is an earl stage (Ta) bladder tumor marker or 
lata stage if target Is a late stage (T2-T4) bladder tumor marker or 
T2-T4 grade 3 papilloma marker or T2-T4 grade 3 solid tumor marker or Upregulate stage 
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Pkey 


ExAccn 


UnigenelD 


Urugerte Title 


R1 


R2 


Target Type 


400440 


X83957 


Hs.83870 


nebutin 


0L17 


205 


downregulate stage 


400888 








a24 


1.97 


downregulate stage 


401566 








0.19 


4.06 


downf egulate stage 


401669 








0.2 


205 


downregulate stage 


401691 








0.04 


10.13 


downregulate stage 


401905 








0.3 


1.87 


downregulate stage 


402076 








0.06 


6.51 


downregulate stage 


402110 








0.43 


235 


downregulate stage 


402271 


NM.002197 


Hs.1 54721 


aconitase 1, soluble 


0.21 


216 


downregulate stage 


403362 






0.18 


4.44 


downregulate stage 


403687 








0.32 


1.91 


downregulate stage 


403959 








0.14 


227 


downregulate stage 


404015 
404059 








0.2 
0.36 


248 
1.84 


downregulate stage 
downregulate stage 


404152 








0.32 


1.85 


downregulate stage 


404498 








0.14 


218 


downregulate stage 


404819 








0.19 


5.25 


downregulate stage 


405001 


U58196 


Hs.296281 


Interleukin enhancer binding factor 1 


0.16 


2.92 


downregulate stage 


405349 






0.16 


3.8 


downregulate stage 


405390 








0.3 


254 


downregulate stage 


405735 








0.13 


244 


downregulate stage 


405968 








0.26 


1.85 


downregulate stage 


406017 








0.32 


228 


downregulate stage 


406305 


BE261320 


Hs.297096 


transcriptional adaptor 3 (ADA3, yeast h 


0.42 


1.93 


downregulate stage 


406320 






0.37 


201 


downregulate stage 


406704 


M21665 


Ks.929 


myosin, heavy polypeptide 7, cardiac mus 


0.3 


284 


downregulate stage 


406706 


X03740 


Hs.231581 


myosin, heavy polypeptide 1, skeletal mu 


0.14 


7.4 


downregulate stage 


406707 


S73840 


Hs.931 


myosin, heavy polypeptide 2, skeletal mu 


0.05 


1251 


downregulate stage 


407013 


U35637 




gb:Human nebulin mRNA, partial cds 
titin 


0,14 


217 


downregulate stage 


407245 


X90568 


Hs.172004 


0.02 


15.21 


downregulate stage 


407330 


AA582607 




gb:nn51b05.s1 NCLCGAPJ<id6 Homo sapiens 


0.42 


1.87 


downregulate stage 


407571 


A1446183 


Hs.9572 


ESTs 


0.38 


213 


downregulate stage 


407682 


AL035858 


Hs.3807 


FXYD domain-containing Ion transport reg 


0.34 


256 


downregulate stage 


407815 


AW373860 


Hs.301716 


ESTs 


0.31 


244 


downregulate stage 


407834 


AW084991 


Hs.2610O 


ESTs 


0.15 


298 


downregulate stage 


407891 


AA486620 


Hs.41135 


endomucin-2 


0.15 


3.33 


downregulate stage 


407906 


AA369665 


Hs.41185 


Homo sapiens mRNA; cDNA DKFZp56401262 (f 


0.12 


8.05 


downregulate stage 


407938 


AA905097 


Hs.85050 


phosphoJamban 


0.08 


8.77 


downregulate stage 


407965 


W21483 


Hs.41707 


heat shock 27kD protein 3 


0.26 


229 


downregulate stage 


408009 


AF020498 


Hs.41735 


purinergic receptor P2X, ligand-gated to 


0.49 


1.91 


downregulate stage 


408139 


AA451966 


Hs.43005 


RAB94ike protein 


0.41 


1.88 


downregulate stage 


408221 


AA912183 


Hs.47447 


ESTs 


0.04 


24.1 


downregulate stage 


408374 


AW025430 


Hs.155591 


forkhead box F1 


0.35 


285 


downregulate stage 


408493 


BE206854 


Hs.46039 


phosphoglycerate mutase 2 (muscle) 


0.09 


9.04 


downregulate stage 


408508 


A1806109 


Hs.135736 


KIAA15B0 protein 


0.45 


22 


downregulate stage 


408614 


AL137698 


Hs.46531 


Homo sapiens mRNA; cDNA DKFZp434C1915 (f 


0.13 


5.48 


downregulate stage 


408652 


R43409 


Hs.6829 


ESTs 


0.33 


25 


downregulate stage 


408753 


A1337192 


Hs.47438 


SH3 domain binding glutamic acid-rich pr 


0.05 


6.94 


downregulate stage 


408896 


AI610447 


Hs,48778 


niban protein 


0.39 


201 


downregulate stage 


409277 


T05558 


Hs.156880 


ESTs 


0.18 


275 


downregulate stage 


410023 


AB017169 


Hs.57929 


slit (Drosophila) homotog 3 


0.07 


2.7 


downregulate stage 


410036 


R57171 


Hs.57975 


calsequestrin 2, cardiac muscle 


0.09 


5.23 


downregulate stage 


410132 


NM.003480 


HS.5B882 


MicrofibrO-associated giycoproteln-2 


0.24 


234 


downregulate stage 


410168 


AW834050 


Hs.9973 


tensin 


0.39 


217 


downregulate stage 


410243 


D83402 


Hs.289006 


ESTs, Weakly similar to alternatively sp 


0.11 


282 


downregulate stage 


410339 


AI916499 


Hs.298258 


ESTs 


0.15 


216 


downregulate stage 


410677 


NWL003278 


Hs.65424 


tetranectin (plasmtnogen-binding protein 


0.32 


265 


downregulate stage 


410868 


T06529 


Hs.98518 


Homo sapiens cONA FU1 1490 fls, clone HE 


0.2 


274 


downregulate stage 


411048 


AK001742 


Hs.67991 


hypothetical protein DKFZp434G0522 


0.2 


1.92 


downregulate stage 


411067 


A1581006 


Hs.301543 


ESTs 


0.11 


3.41 


downregulate stage 


411069 


AL133092 


Hs.68055 


hypothetical protein OKFZp434l0428 


0.17 


5.8 


downregulate stage 


411544 


H92064 


Hs.301548 


ESTs 


0.08 


13.8 


downregulate stage 


411741 


AW859650 




gb:RC0-CT0358-071299-01 1-d03 CT035B Homo 


0.36 


25 


downregulate stage 


412047 


AA934589 


Hs.49696 


ESTs 


0.18 


3.57 


downregulate stage 


412095 


AI524707 


Hs.5921 


Homo sapiens cDNA: FU21592 fts, clone C 


0.32 


1.89 


downregulate stage 


412389 


AW947655 




gb:ROWOT0003-140300-031-b07 MT0003 Homo 


0.38 


26 


downregulate stage 


412442 


AI983730 


Hs.26530 


serum deprivation response (phosphatidyl 


0.12 


167 


downregulate stage 


412519 


AA196241 


Hs.73980 


troponin T1, skeletal, slow 


0.24 


1.86 


downregulate stage 


412622 


AW664708 


Hs.171959 


ESTs 


0.06 


5.45 


downregulate stage 


412649 


NM.002206 


Hs.74369 


integrln, alpha 7 


0.29 


295 


downregulate stage 
downregulate stage 


412659 


AW753865 


Hs.74376 


olfactomedin related ER localized protei 


0.18 


206 


412758 


Y07818 


Hs.74566 


dihyatopyriniidinase-tta 3 


0.3 


223 


downregulate stage 
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U41518 


Hs.74602 


aquaporin 1 (channel-forming Integra) pr 


U.ll 


0 71 
LI \ 


down regulate stage 


412975 


T70956 


Hs.75106 


dusterin (complemenl lysis inhibitor, S 


0.44 


2.03 


downregulate stage 


413074 


AI871368 


Hs.8417 


ESTs 


0.47 


1.91 


down regulate stage 


413272 


AA1 27923 


Hs. 293256 


ESTs 


0.(3 


4,44 


downregidatB stage 


413276 


Z24725 


Hs.75260 


rrtftogon inducible 2 


0.23 


Z48 


downregulate stage 


413508 


BE145364 




gb;ILO-HT0 198-1 51 099-125*05 HT0198 Homo 


0.31 


2.53 


downregulate stage 


413624 


BE177019 


HsJ5445 


S PARC-like 1 (masty, nevtn) 


0.33 


2.17 


downregulate stage 


413778 


AA090235 


Hs.75535 


myosin, light polypeptide 2, regulatory, 


0.33 


2.63 


downregulate stage 


414063 


H26904 


Hs.75736 


apoBpoprotfiin D 


0.42 


1.85 


downregulate stage 


414241 


AA425085 


Hs.4007 


Sarcdemmal-assoctated protein 


0.16 


2.22 


downregulate stage 


414290 


AI568801 


Hs.71721 


ESTs 


0.02 


10 


downregulate stage 


414629 


AA34&24 


HsJ6588 


carboxylesterase 1 (monocyte/macrophage 


0.13 


4.14 


downregulate stage 


414657 


AA424074 


Hs.76780 


protein phosphatase 1 , regulatory (inhib 


0.33 


2,14 


downregulate stage 


414712 


N88858 


Hs.77039 


ribosomal protein S3A 


0.4 


2.5 


downregulate stage 


414903 


M451700 


Hs.85835 


Homo sapiens cDNA: FU22341 fis, clone K 


0.3 


3.3 


downregulate stage 


415165 


AW887604 


Hs. 78065 


complemenl component 7 


0.04 


3.41 


downregulate stage 


415274 


AF001548 


Hs.78344 


myosin, heavy polypeptide 11, smooth mus 


0.2 


3.29 


downregulate stage 


415447 


Z97171 


Hs.78454 


myociPn, trabecular meshwork inducible 


0.15 


6.55 


downregulate stage 


415672 


N53097 


Hs.1 93579 


ESTs 


0.28 


3.55 


downregulate stage 


415934 


NM_000928 


Hs.992 


phospholipase A2, group IB (pancreas) 


0.34 


2.64 


downregulate stage 


416127 


N49843 


Hs.79022 


GTP-bindlng protein overexpressed in ske 


0.3 


1.98 


downregulate stage 


416349 


X69089 


Hs.79227 


myomesin (M-proteln) 2 (1651(D) 


0.41 


1.96 


downregulate stage 


416585 


X54162 


Hs,79386 


teiomodin 1 (smooth muscle) 


0.02 


49,3 


downregulate stage 


416854 


H40164 


Hs.80296 


Purkinje ceD protein 4 


0.02 


7.55 


downregulate stage 


416941 


BE000150 


Hs.48778 


niban protein 


0.27 


2.16 


downregulate stage 


416982 


J05401 


Hs.80691 


creatine kinase, mitochondrial 2 (sarcom 


0.29 


2.43 


downregulate stage 


417011 


F08212 


Hs. 234898 


ESTs 


0.41 


2.06 


downregulate stage 


417298 


AW565339 


Hs.37958 


ESTs 


0.27 


3.7 


downregulate stage 


417501 


AL041219 


Hs.82222 


sema domain, immunoglobulin domain (Ig), 


0.39 


2.08 


downregulate stage 


417553 


L09190 


Hs.82276 


trichohyafn 


0.29 


2.59 


downregulate stage 


417987 


AA210872 


Hs.50133 


ESTs 


0.22 


2.09 


downregulate stage 


418297 


R91254 




gb:yp94e12.s1 Soares fetal liver spieen 


0.28 


1.9 


downregulate stage 


418332 


R34976 


HS.7B293 


ESTs 


0.2 


3.9 


downregulate stage 


418391 


NMJJ03281 


Hs.84673 


troponin I, skeletal, stow 


0.35 


2.02 


downregulate stage 


418409 


AA219332 


Hs, 120869 


ESTs, Weakly similar to AF092922 1 retin 


0.21 


3.8 


downregulate stage 


418421 


R58620 


Hs.85050 


phospholamban 


0.2 


2.08 


downregulate stage 


418489 


U76421 


Hs.85302 


adenosine deaminase, RNA-specific, B1 (h 


0.05 


21.55 


downregulate stage 


418533 


NMJ)04533 


Hs.85937 


myosin-binding protein C, fast-type 


0.42 


1.95 


downregulate stage 


418787 


AW296134 


Hs.86999 


ESTs 


0.48 


1.87 


downregulate stage 


418793 


AW3829B7 


Hs.88474 


prostaglandin-endoperoxide synthase 1 (p 


0.26 


243 


downregulate stage 


418947 


W52990 


Hs.22860 


ESTs 


0.13 


7.18 


downregulate stage 


419037 


R39895 


Hs.7864 


ESTs 


0.27 


2 


downregulate stage 


419441 


AW023731 


Hs.274368 


Homo sapiens mRNA; cDNA DKFZp586l1524 (f 


0.19 


5.25 


downregutate stage 


419535 


AW1 39550 


Hs.1 15173 


ESTs 


0.31 


2.59 


downregulate stage 


419682 


H13139 


Hs.92282 


paired-like homeodomain transcription fa 


0.28 


238 


downregulate stage 


419665 


W76083 


Hs.1 73077 


ESTs 


0.4 


2.21 


downregulate stage 


419703 


AI793257 


Hs.1 281 51 


ESTs 


0.09 


3.52 


downregulate stage 


419942 


U25138 


Hs.93841 


potassium large conductance caiciunvacti 


0.28 


2.96 


downregulate stage 


420058 


AK001423 


Hs.94694 


Homo sapiens cDNA FU10561 fis, done NT 


0.3 


2.09 


downregulate stage 


420195 


N44348 


Hs.300794 


ESTs 


0.22 


2.79 


downregulate stage 


A2026i 


AW206093 


Hs.74B 


fibroblast growth factor receptor 1 (fms 


0.35 


1.98 


downregulate stage 


420674 


NMJJ00O55 


Hs.1 327 


butyrylcholinesterase 


0.29 


3.5 


downregulate stage 


421296 


NMJJ02666 


Hs.103253 


perHtpin 


0.36 


2.11 


downregulate stage 


421639 


NO12082 


Hs.297921 


Homo sapiens mRMA full length Insert cDN 


0.13 


4.3 


downregulate stage 


421763 


AW163500 


Hs.1 08080 


cysteine and glycine-dch protein 1 


0.26 


3.49 


downregutate stage 


421853 


AL1 17472 


Hs.108924 


DKFZP586P1422 protein 


0.14 


5 


downregulate stage 


422103 


AA984330 


Hs.1 11676 


protein kinase H1 1 ; small stress protein 


0.2 


2.29 


downregulate stage 


422287 


F16365 


Hs. 114346 


cytochrome c oxidase subunit Vila polype 


0.27 


2.58 


downregulate stage 


422320 


AI745249 


Hs.23650 


ESTs, WeaWy similar to AAB47496 NG5 [H. 


0.24 


2.95 


downregulate stage 


422633 


X56832 


Hs.1 18804 


enofase 3, (beta, muscle) 


0.23 


3.67 


downregulate stage 


422639 


AI929377 


Hs.1 73724 


creatine kinase, brain 


0.39 


1.97 


downregulate stage 


423334 


AK000906 


Hs.1 27273 


hypothetical protein FL) 10044 


0.37 


2.29 


downregulate stage 


423587 


AA328074 


Hs.284256 


hypothetical protein FU 14033 similar to 


0.37 


2.47 


downregulate stage 




AL035447 


no. lOHQTf 


hvnnthatfral nmlpin 1 fif 
nypuuiBucoi proiein lvaa}( i-jo 


0.24 


2.43 


rtawnreniitata <ttana 
uvwiucyuioio oiayo 


424161 


AL039432 


Hs.142517 


Homo sapiens mRNA; cDNA DKFZp434F0810 (f 


0*27 


228 


downregutate stage 


424206 


NM.003734 


Hs.198241 


amine oxidase, copper containing 3 (vase 


0.3 


2.59 


downregulate stage 


424479 


AF064238 


Hs.149098 


smoothelin 


0.26 


3.29 


downregulate stage 


424580 


AA446S39 


Hs.35092 


ESTs 


0.15 


2.57 


downregulate stage 


424846 


AU077324 


Hs.1832 


neuropeptide Y 


0.4 


2.04 


downregulate stage 


424938 


AW102607 


Hs.245233 


ESTs 


0.29 


2.16 


downregulate stage 


424982 


U94777 


Hs.154084 


phosphorytase, glycogen; muscle (McArdle 


0.42 


1.89 


downregulate stage 


425383 


D83407 


Hs.156007 


Down syndrome critical region gene 1-lik 


0.14 


1.86 


downregulate stage 


425545 


N98529 


Hs.158295 


Human mRNA for myosin light chain 3 (MIX 


0.03 


13.25 


downregulate stage 


425622 


AW380847 


Hs.16578 


ESTs 


0.3 


2.19 


downregutate stage 


425751 


T19239 


Hs.1940 


crystaliin, alpha B 


0.47 


1.92 


downregulate stage 


425869 


AA524547 


Hs.160318 


FXYD domain-containing ion transport reg 


0.45 


1.85 


downregulate stage 


425887 


AL049443 


Hs.161283 


Homo sapiens mRNA; cDNA DKFZp586N2020 (f 


0.19 


285 


downregutate stage 
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425932 


M81650 


Hs.1 968 


426354 


NM.004010 


Hs.169470 


425429 


X73114 


Hs,1 69849 


426752 


X69490 


Hs.172004 


426809 


BE313114 


Hs.29706 


427078 


AI676062 


Hs.111902 


427136 


AL117415 


Hs.173716 


427164 


AB037721 


Hs.173871 


4271 B5 


AA398930 


Hs.1B6674 


427373 


AB007972 


Hs.177533 


427393 


AB029018 


Hs.177635 


427665 


AF134603 


Hs.180141 


427676 


AA394062 


Hs.180266 


427888 


AA4170B8 


Hs.137598 


427980 


AA418305 




428087 


AA100573 


Hs.182421 


428138 


AA773842 


Hs.293799 


428221 


U96781 


Hs.183075 


428329 


AA426091 


Hs.98453 


428409 


AW117207 


Hs.98523 


428411 


AW291464 


Hs.10338 


428648 


AF052728 


Hs.188021 


428649 


AL045716 


Hs.188228 


428899 


AA744810 


Hs.194431 


429350 


AI754634 


Hs.1 31987 


429525 


N92540 


Hs.205353 


429545 


AS824164 


Hs.77667 
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6.2 


0.11 


upregulate stage 


410562 


AW858528 




-L.AIJa />TA9 J 1 *r"/^*>rtA 4)4f% 1*4)4 /VTAOJ4 11- . . - 

gb:CM3-CT0341-150300-119-n11 CT0341 Homo 


1 


1 


upregulate stage 


410579 


AK001628 


Hs.64691 


KIAA0483 protein 


11.1 


0.06 


upregulate stage 


410634 


AW888653 


Hs.265859 


ESTs 


1 


1 


upregutate stage 


410664 


NM_006033 


Hs.65370 


lipase, endothelial 


3.95 


0.1 


upregulate stage 


410668 


BE379794 


Hs.65403 


hypothetical protein 


1.82 


0.41 


upregulate stage 


410730 


AW368860 


Hs.293950 


ESTs 


9.25 


0.07 


upregutate stage 


410751 


AA357918 




gb:EST66726 Fetal lung Hi Homo sapiens 


1 


1 


upregulate stage 


410754 


T63840 


• 


gb^c16b10.s1 Stratagene lung (937210) H 


3.1 


0.14 


upregulate stage 


410762 


AF226053 


Hs.66170 


HSKM-B protein 


5.55 


0.1 


upregulate stage 


410764 


AW978159 


Hs.250164 


ESTs, WeaWy similar to coded for by C. 


1 


0.2 


upregulate stage 


410782 


AW504860 


Hs.26B836 


Homo sapiens cDNA FU 12673 fis, clone NT 


1.75 


0.25 


upregutate stage 


410794 


AA248010 


Hs. 154669 


ESTs 


1 


0.67 


upregulate stage 


410804 


U64820 


Ks.66521 


Machado-Joseph disease (spinocerebellar 


3.2 


0.17 


upregulate stage 


410844 


AW807073 




gb:MR4-ST0062-0311 99-01 8-d06 ST0062 Homo 


1 


0.8 


upregutate stage 


410855 


X97795 


Hs.66718 


RAD54 (S.cerevisiaeH^e 


6.5 


0.12 


upregulate stage 


410910 


AW810204 




A.imi ATAir\F AMiinn f\ 4~r jnn r«rr\ 11 

gb:MR4-ST01 25-021 1 99-01 7-d08 ST0125 Homo 


9.35 


0.08 


upregulate stage 


410973 


AW812278 




gb:RC0-ST01 74-21 1099-01 1-h12 ST0174 Homo 


1 


1 


upregutate stage 


410976 


R36207 


Hs.25092 


ESTs 


6.35 


0.1 


upregulate stage 


410997 


AW812877 




gb:RC3-ST01 86-3001 00-01 7-e04 ST0186 Homo 


1 


1 


upregulate stage 


410998 


W28247 


Hs.82007 


EOAA0094 protein 


2.45 


0.18 


upregulate stage 


41 1036 


AA857218 


Hs.297007 


ESTs 


4.05 


0.14 


upregulate stage 


411110 


K93D00 




gb^v07f01.s1 Soares fetal liver spleen 


1 


0.36 


upregulate stage 


411132 


AW819191 




0b:CM1-STu2B3-O71299-O51-OO8 ST02B3 Homo 


1 


1 


upregulate stage 


411137 


AWB19455 




gb:RC5-ST0293-021 299-03 1-WJ4 ST0Z93 Homo 


3.65 


0.18 


upregulate stage 


411157 


AWB19867 




a L,AiM CTA1fl4 ft'TAOAft 4C4 fni OTAOOa Lliuaii 

gb:QV0-ST0294-0703Ol)-151-tU2 51 0294 Homo 


3.2 


0.2 


upregulate stage 


411159 


AWB20178 




go:QV0*ST0294-1 00400-185-607 ST0294 Homo 


1 


0.27 


upregulate stage 


411170 


AW820503 

r*»» utu JUJ 




ob , OV2>ST0298-140200-042-b05 ST0298 Homo 


1 


1 




411193 


AW821484 




Qb:lL2-ST031 1-211 299-028-F12 ST031 1 Homo 


1 


0.24 


upregulate stage 


411242 


BE146808 




gb:QV4-HT0222-181099-013-g03 HT0222 Homo 


2.55 


0.26 


upregulate stage 


411245 


AWB33441 




gb:QV4-F0008-2710994)2f>g01 TT0008 Homo 


9.62 


0.09 


upregulate stage 


411263 


BE297802 


Hs.69360 


klnesin-like 6 (mitotic centromere-assoc 


14 


0.32 


upregulate stage 


411282 


AW995011 




gb:QVO-BN0040-170300-161-d07BN0040 Homo 


1 


1 


upregulate stage 


411284 


N28519 


Hs.135191 


ESTs, WeaWy similar to unnamed protein 


3.25 


0.12 


upregulate stage 


411294 


AW859729 


Hs.42680 


ESTs 


1 


1 


upregutate stage 


411327 


AW836922 




gb:QVUTO(»6-150200-074-h06 LT003S Homo 


1 


0.37 


upregulate stage 


411338 


AW731782 


Hs.116122 


ESTs, Weakly similar to unnamed protein 


5 


0.13 


upregulate stage 


411339 


BE164598 




gb:RC3-HT0470-12020(W)13^10 HT0470 Homo 


1 


0.25 


upregulate stage 


411383 


AA001394 


Hs.69749 


WAA0087 gene product 


3.6 


0.18 


upregutate stage 


411387 


AW842339 


Hs.130815 


hypothetical protein FU21870 


8.75 


0.09 


upregutate stage 


411400 


AA311919 


Hs.69851 


GAR1 protein 


12.1 


0.07 


upregulate stage 
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111425 


AW846012 




gD:Rw«C10lb>2oU333-UU>cUi oiuioonomo 


1 
i 


ft 7 A 


upreguiate stage 


AHAC4 

411461 


AW847937 




gD;fw-CTUii3-21u2BU-U42-uu2 uiU2io nomo 


4 
1 


4 

I 


upreguiate stage 


411326 


AtAIOCAOO? 




_w.ti o. nTAo*o_ooiiGQ-ftoo_fY\R PTft9iQU(snv\ 
gDML>CI021^221iyw2b-UU0 UlUZlb nomo 


i 
1 


4 

1 


uprey uiaio siogs 




AWoOllbO 




gDUL>CTU22U*lw2llU-Un-nU0 UIUZ2U nomo 




n 17 


upieginaiB siago 


411568 


OC4 A/Ct\1 

BE144593 




gj):MKD-HTUlbM4uW-UU2HU4 MIUiO/ nomo 


4 

1 


4 

1 


upreguiate stage 


411571 


A A 4 *W*fW 

AA1 22393 


Hs.70811 


i. . . _ _ it- - it — I . - 1 . * . CI 10AE4C 

hypothetical prolan FU2Q516 


J. bo 


ft 4A 

U.14 


upreguiate stage 


411605 


AW005831 


lt_ Mi7f1 

Hs.20479 


ESTs 


3.0 


ft no 


upreguiate stage 


411626 


A \ AfTO t A CO 

AW793453 


Hs71109 


I/I A A 1 *>1ft ...U. 

KIAA1229 proteui 


4 

1 


1 


upreguiaie stage 


411630 


U42349 


Hs.71119 


Putative prostate cancer tumor suppresso 


A 4 

4.1 


0.11 


upreguiate stage 


411643 


AI924519 


Hs. 192570 


Homo sapiens cDNA FU22028 fis, clone H 


4 
1 


0.28 


upreguiate stage 


411653 


AF070578 


Hs71 168 


Homo sapiens clone 24674 mRNA sequence 


0.9 


U.Uo 


upreguiate stage 


411727 


At imrni a n 

AW858443 




gb:CMr>CT0341-26010Q-16W10CT0341 Homo 


1 


1 


upreguiate stage 


411771 


AW993247 




gD:KW-BN0Qo3- ioU2WMn4-ftU3 BNwoO nOntO 


2.0 


ft iA 

U.14 


upreguiaie stage 


411787 


AW863558 




PIIAA4A tJAOAA 4A1 _«n PMOfHA 

gb:MR3-SN001 0-240300- 102-c 10 5N0010 Homo 


1 


1 


upreguiate stage 


411788 


AW897793 




_ L, , r>» 1 4 MMfiftM OQAAftft OflO fPV7 kltiflflCO Llnmn 

gD:UM1-NNUuo3-2oU4UU-2U3HUr NNUUoo nomo 


0.7 


ft 1C 


upreguiate stage 


411826 


A tAtrx J^rt A£* 

AW947946 




gb:Pr\^MT0011-24O30O>00 1-309 MT0011 Homo 


3.25 


O.id 


upreguiate stage 


411835 


U29343 


Hs.72550 


hyaluronarwnediated motility receptor (R 


4 
1 


4 
1 


upreguiate stage 


41 1860 


T89420 




gb:yd98f04.s1 Scares fetal liver spleen 


1 


0.22 


upreguiate stage 


411874 


AA096106 


Hs.20403 


ESTs 


5.75 


0.11 


upreguiate stage 


411917 


AW876360 


Hs.3592 


Homo sapiens cDNA: FU 22555 fis, clone H 


1 


0.33 


upreguiate stage 


41 1928 


AA888624 


Hs.19121 


adaptor-related protein complex 2, alpha 


4.75 


n io 
U.12 


upreguiate stage 


411932 


AW876548 




gb:RC3-PT0028-1 901 00-01 2-h02 PT0028 Homo 


1 


0.38 


upreguiate stage 


411943 


BE502436 


Hs79S2 


ESTs, Weakly similar to putative [Ceteg 


3.82 


0.23 


upreguiate stage 


411945 


AL033527 


Hs.92137 


v-myc avian myelocytomatosis viral oncog 


4.65 


0.15 


upreguiate stags 


411991 


X58822 


Hs73010 


interferon, omega 1 


Z45 


0.14 


upreguiate stage 


412040 


D86519 


Hs73086 


neuropeptide Y receptor Y6 (pseudogene) 


4.6 


0.14 


upreguiaie stage 


412088 


A1689496 


Hs.108932 


ESTs 


2.82 


0.18 


upreguiate stage 


412134 


AW8955B0 




gb:QV4-NN0038-27fJ4o0-187-g08 NN0Q38 Homo 


6.4 


0.1 


upreguiate stage 


412140 


AA219691 


Hs73625 


RAB5 Interacting, kinesin-Bke (rabkines 


17.05 


0.04 


upreguiate stage 


412231 


AW902491 


Hs.289088 


heat shock 90kO protein 1, alpha 


1 


0.91 


upreguiate stage 


412296 


AW935233 




gb:QVO-DT0020-090200-107-a06 DT0020 Homo 


1 


1 


upreguiate stage 


412327 


AW937355 




gb:QV3-DT0043-21 1299-044-aOo DT0043 Homo 


1 


1 


upreguiaie stage 


412357 


AW939537 




gb:QV1-DT0072-1 10200-066-f05 DT0072 Homo 


1 


0.24 


upreguiate stage 


412359 


AW837985 




gb:QV3-LT0048-14020 f >083^05 LT0048 Homo 


1 


0.41 


upreguiate stage 


412367 


AW945964 




gb:avO-ET0001 -050500- 228-609 cTOUUl Homo 


1 


n oo 
U.22 


upreguiate stage 


412529 


BE271224 


Hs.266273 


Homo sapiens cONA FU 13346 fis, clone OV 


4.45 


0.14 


upreguiaie stage 


412530 


AA766268 


Hs.266273 


Homo sapiens cDNA FU 13346 fis, clone OV 


9.3 


0.08 


upreguiate stage 


412537 


AL031778 


Hs797 


nuclear transcription factor Y, alpha 


4.2a 


ft AA 

U. 14 


upreguiate stage 


412547 


W27161 




gb:23a12 Human retina cDNA randomly prim 


1 


1 


upreguiate stage 


412559 


T31474 




gb:EST33147 Human Embryo Homo sapiens cD 


1 


0.26 


upreguiate stage 


412636 


NM.004415 


HS74316 


desmoplakin (DPI, DPII) 


12.05 


0.05 


upreguiate stage 


412648 


AA1 15211 


Hs.69658 


EST 


1 


0.28 


upreguiaie stage 


412668 


AA456195 


Hs. 10056 


ESTs 


1075 


0.07 


upreguiate stage 


412671 


AW977734 




gb:EST389963 ^MGE resequences, MAGO Homo • 


2.65 


0.3 


upreguiate stage 


412673 


AL042957 


Hs.31845 


ESTs 


4.6 


0.11 


upreguiate stage 


412723 


a loin j rf\ 

AA648459 


Hs. 1799 12 


ESTs 


2.55 


0.11 


upreguiate stage 


412739 


AA1 16018 


Hs.271809 


Homo sapiens cDNA: FU 22406 lis, clone H 


l.b 


n 04 
U.24 


upreguiate stage 


412744 


N31101 




gb^x52a03.r1 Soares melanocyte 2NbHM Ho 


2 


0.23 


upreguiate stage 


412778 


AA1 20882 


Hs.159244 


ESTs 


1 


1 


upreguiate stage 


412811 


H06382 


1 1. <"n Ann 

Hs.21400 


ESTs 


1 


A Ad 

0.49 


upreguiate stage 


412838 


D61870 




gb:HUM218F11B uontech human aorta polyA 


1 


A OX 

0.J4 


upreguiate stage 


412854 


nmnA 4 An 

BE004149 


Hs.31161 


ESTs 


4 

1 


4 

1 


upreguiate stage 


413075 


D59828 


HS.70953 


ESTs 


1 


ft 77 


upreguiate stage 


413109 


AW389845 


HS.llOooO 


ESTs 


o.yj 


ft i 
U.l 


upreguiate stage 


413117 


BE066107 


Hs. 138484 


ESTs, Weakly similar to ALU1_HUiMAN ALU S 


1 


0.22 


upreguiate stage 


413119 


BE065941 




gb:RC3-BT03l9-1X100-012-d12 BT0319Homo 


1 


0.87 


upreguiate stage 


413141 


BE166323 




gb:QV4-HT0492-270100-086-e12 HT0492 Homo 


5.45 


0.12 


upreguiate stage 


413219 


AA878200 


Hs, 11 8727 


Homo sapiens cDNA FU 13692 fis, clone PL 


254 


0.19 


upreguiate stage 


413228 


AA127518 


Hs. 195870 


ESTs 


1 


1 


upreguiate stage 


413273 


U75679 


Hs75257 


Hairpin binding protein, histone 


5.05 


0.11 


upreguiate stage 


413278 


BE563085 


Hs.833 


interferort-s&muiated protein, 15 kOa 


1.45 


0.5 


upreguiate stage 


413294 


BE144034 




gb:MRO-HT0165-191199-004-a02HT0165 Homo 


1 


1 


upreguiate stage 


413324 


V00571 


Hs.75294 


corticotropin releasing hormone 


6.95* 


0.03 


upreguiate stage 


JM0140 
41334* 


AAioRROe 






1 

i 


■) 


nnrpntitata ^(ano 
uyi cyuiouj otoyo 


413430 


R22479 


Hs.24650 


Homo sapiens cDNA FU 13047 fis, clone NT 


3 


0.18 


upreguiate stage 


413707 


8E158679 




gb:CM0-HTO395-280100-169^04 HT0395 Homo 


1 


0.28 


upreguiate stage 


413743 


BE1610O4 




gb:PMO-HT0425-17010r>002-h03 HT0425 Homo 


1 


1 


upreguiate stage 


413753 


U17760 


Hs.301103 


Human DNA sequence from done 272L1 6 on 


227 


0.03 


upreguiate stage 


413786 


AW613780 


Hs.13500 


ESTs 


9-9 


0.07 


upreguiate stage 


413792 


BE166924 




gb:CM4-HT05O1-240300-51WOl HT0501 Homo 


1 


1 


upreguiate stage 


413804 


T64682 




gb7c46b02.r1 Stratagene liver (937224) 


0.99 


0.75 


upreguiate stage 


413833 


215005 


HS75573 


centromere protein E (312kD) 


Z55 


0.17 


upreguiate stage 


413854 


BE174300 


Hs.44581 


heat shock protein hsp70-r elated protein 


1.25 


0.24 


upreguiate stage 


413918 


AW015898 


HsJ1245 


ESTs 


4 


0.11 


upreguiate stage 


41396B 


AW5C0374 


Hs.64056 


ESTs 


10.85 


0.07 


upreguiate stage 


414091 


T83742 




gb:yd67g0ls1 Soares fetal liver spleen 


8.9 


0.1 


upreguiate stage 


414099 


U11313 


Hs.75760 


sterol carrier protein 2 


10.3 


0,06 


upreguiate stage 
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414116 


AA587370 


Hs.71584 


ESTs 


1 


1 


upregulate stage 


414127 


AI431863 


Hs.1 35270 


ESTs 


2.85 


0.13 


upregulate stage 


414169 


AA136169 


Hs.149335 


ESTs 


8.95 


0.09 


upregulate stage 


414275 


AW970254 


Hs.889 


Charot-Leyden crystal protein 


7.05 


0.05 


upregulate stage 


414304 


AI621276 


Hs. 165998 


OKFZP564M2423 protein 


1 


0.24 


upregulate stage 


414338 


N80751 


Hs.301471 


ESTs 


10.3 


0.08 


upregulate stage 


414447 


AA147549 


Hs.1 09909 


ESTs 


3.4 


0.16 


upregulate stage 


414494 


AA768491 


Hs.6783 


Homo sapiens cONA: FU22724 fis, done H 


3.4 


0.18 


upregulate stage 


414520 


AA1 48806 


Hs.204046 


ESTs 


1 


0.21 


upregulate stage 


414569 


AF109298 


Hs.1 18256 


prostate cancer associated protein 1 


3.1 


0.18 


upregulate stage 


414575 


H11257 


Hs^95233 


ESTs 


3.1 


0.15 


upregulate stage 


414597 


H67472 


Hs.34274 


ESTs 


4.6 


0.11 


upregulate stage 


414643 


H46177 


Hs.1 19316 


ESTs 


1 


0.28 


upregulate stage 


414658 


X58528 


Hs.76781 


ATP-binding cassette, sub-family D (ALD) 


7.75 


0.08 


upregulate stage 


414661 


T97401 


Hs.21929 


ESTs 


1 


0.26 


upregulate stage 


414683 


S78296 


Hs.76888 


intemexin neuronal intermediate fflamen 


2.72 


0.25 


upregulate stage 


414735 


BE468016 


Hs.281904 


ESTs 


1 


0.36 


upregulate stage 


414737 


Al 160386 


Hs.1 25087 


ESTs 


5.5 


0.1 


upregulate stage 


414747 


U30872 


Hs.77204 


centromere protein F (350/400k0, mitosfn 


3.19 


0.24 


upregulate stage 


414774 


X02419 


Hs.77274 


plasminogen activator, urokinase 


1.45 


0.49 


upregulate stage 


414783 


AW069569 


Hs.75839 


2inc finger protein 6 (CMPX1) 


4.65 


0.13 


upregulate stage 


414799 


AI752416 


Hs.77326 


insufin-Ilke growth factor binding prate 


1.7 


0.46 


upregulate stage 


414833 


T07114 




gb:EST05003 Fetal brain, Stratagene {cat 


4.5 


0.13 


upregulate stage 


414883 


AA926960 


Hs.77550 


COC28 protein kinase 1 


3.36 


0.22 


upregulate stage 


414885 


AA157531 


Hs.269276 


ESTs 


2.7 


0.21 


upregulate stage 


414918 


AJ219207 


Hs.72222 


Homo sapiens cDNA FU 13459 (is, clone PL 


0.87 


0.69 


upregulate stage 


414985 


CI 7372 




gb:C17372 Clontech human aorta polyA+ mR 


1 


0.42 


upregulate stage 


415025 


AW207091 


Hs.72307 


ESTs 


5.3 


0.06 


upregulate stage 


415033 


D31476 


Hs.301448 


Homo sapiens cDNA FU12152 fis, clone MA 


1 


1 


upregulate stage 


415060 


AJ223810 


Hs.43213 


ESTs, Weakly similar to IEFS.HUMAN TRANS 


6.05 


0.1 


upregulate stage 


415068 


Z19448 


Hs.1 31887 


ESTs, Weakly similar to ORF YNL227C (S.c 


4.5 


0.13 


upregulate stage 


415095 


D59592 


Hs.34745 


ESTs 


1 


0.44 


upregulate stage 


415099 


AI492170 


Hs.77917 


ubtquitin carboxyt-terminal esterase L3 


2.27 


0.29 


upregulate stage 


415104 


D60076 




gb:HUM084E1QA Ctontech human fetal brain 


3.95 


0,13 


upregulate stage 


415114 


D50468 




gb:HUM1 1 1 A06B Oontech human fetal brain 


2.05 


0.2 


upregulate stage 


415138 


C18356 


Hs.78045 


tissue factor pathway inhibitor 2 


5.8 


0.05 


upregulate stage 


415139 


AW975942 


Hs.48524 


ESTs 


1.15 


0.21 


upregulate stage 


415148 


Z36953 


Hs.48527 


ESTs 


2.5 


0.2 


upregulate stage 


415153 


C03508 


Hs.7000 


ESTs 


8.95 


0.09 


upregulate stage 


415178 


D80503 




gb:HUM080A02B Human fetal brain (TFujiwa 


1 


0.15 


upregulate stage 


415217 


H23983 


Hs.26922 


ESTs 


1 


0.31 


upregulate stage 


415227 


AW821113 


Hs.72402 


ESTs 


6.3 


0.11 


upregulate stage 


415238 


R37780 


Hs.21422 


ESTs 


1 


1 


upregulate stage 


415241 


F02208 


Hs.27214 


ESTs 


1 


1 


upregulate stage 


415295 


R41450 


Hs.6546 


ESTs 


1 


0.63 


upregulate stage 


415296 


F05086 




gb:HSC01A011 normalized infant brain cDN 


5.65 


0.1 


upregulate stage 


415327 


H22769 


Hs.1 861 


membrane protein, palmitoylated 1 (55kO) 


8.15 


0.09 


upregulate stage 


415330 


Z44693 


Hs.21422 


ESTs 


3 


0.2 


upregulate stage 


415336 


T77664 


Hs.78362 


Human clone 23839 mRNA sequence 


1 


0.87 


upregulate stage 


415337 


Z44881 


Hs.9012 


ESTs 


8.8 


0.07 


upregulate stags 


415352 


F06565 




gb:HSC1 C6051 normalized infant brain cDN 


1 


1 


upregulate stage 


415364 


F06771 




gb:HSClKD031 normalized infant brain cDN 


1 


1 


upregulate stage 


415371 


R15239 




gb:yfB9b02.r1 Scares infant brain 1MB H 


5.1 


0.13 


upregulate stage 


415412 


F0S049 


Hs.52132 


ESTs 


4.25 


0.16 


upregulate stage 


415451 


H19415 


Hs.268720 


ESTs, Moderately similar to ALU INHUMAN A 


4.1 


0.15 


upregulate stagB 


415462 


R52692 


Hs.1 2698 


ESTs 


4.65 


0.11 


upregulate stage 


415496 


R37637 


Hs.1 2286 


ESTs 


5.4 


0.13 


upregulate stage 
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upregulate stage 


419962 


AA830111 


1 1 Of\4 All 

Hs.291917 


ESTs 


1 


1 


upregutate stage 


419970 


AW612022 


Hs.263271 


ESTs 


9.15 


0,09 


upregulate stage 


419986 


A1345455 


u» Ton 4 C 

Hs.78915 


GA-oirtoing protein transcnption factor, 


3.05 


0.1/ 


upregulate stage 


419998 


AA252691 




gb:zs26d09.n NCI_CGAP_GCB1 Homo sapiens 


1 


0.47 


upregulate stage 


420016 


AW016908 


Hs.88025 


ESTs 


1 


0.8 


upregulate stage 


420047 


AI478658 


Hs. 94631 


brefeldin A-inhibited guanine nucleotide 


4.8 


0,11 


upregulate stage 


420076 


AA827860 


Hs.293717 


ESTs 


5.35 


0.12 


upregulate stage 


420111 


AA255652 




gb:zs21h11/1 NCLCGAP_GCB1 Homo sapiens 


5.3 


0.11 


upregulate stage 


420145 


AA809860 


Hs.256284 


ESTs 


1 


1 


upregulate stage 


420159 


AI572490 


Hs.99785 


Homo sapiens cDNA: FU21245 fis, clone C 


14.8 


0.04 


upregulate stage 


420161 


A16830S9 


Hs.175319 


ESTs 


4.7 


0.11 


upregulate stage 


420184 


AA1 88408 


Hs.95665 


hypothetical protein 


4.35 


0.15 


upregulate stage 


420226 


AA773709 


Hs.1 5281 8 


ubiquitin specific protease 8 


3.1 


0.16 


upregulate stage 


420230 


AL034344 


Hs.298020 


Homo sapiens cDNA FU11796 fis, clone HE 


10.35 


0.06 


upregulate stage 


420236 


AA256763 


Hs.291 111 


ESTs 


4.45 


0.14 


upregulate stage 


420270 


AA257990 




gb:zs35h07.r1 NCI_CGAP_GC81 Homo sapiens 


10.05 


0.08 


upregutate stage 


420297 


A1628272 


Hs.88323 


ESTs 


9.45 


0.09 


upregulate stage 


420344 


BE463721 


Hs.97101 


putative G protein-coupled receptor 


11.7 


0.05 


upregulate stage 


420392 


AI242930 


Hs.97393 


KiAA0328 protein 


1.7 


0.22 


upregulate stage 


420413 


AW971624 


Hs.1 20605 


ESTs 


1 


1 


upregulate stage 


420445 


AA262213 


Hs.193514 


ESTs 


1 


1 


upregulate stage 


420471 


AA262452 


Hs.192268 


ESTs 


3.95 


0.13 


upregulate stage 


420479 


AW1 83695 


Hs.1 86572 


ESTs 


4.95 


0.12 


upregulate stage 


420493 


AI635113 


Hs.270366 


Homo sapiens mRNA; cDNA DKFZp564H0616 (f 


4.4 


0.15 


upregulate stage 


420552 


AK000492 


Hs.98806 


hypothetical protein 


11.55 


0.06 


upregutate stage 


420572 


AL035593 


Hs.99016 


Human DNA sequence from clone 310J6 on c 


1.35 


0.22 


upregutate stage 


420643 


W87731 




gb:zh65g10.r1 Soares_fetal_llv9r_spIeen_ 


1.25 


0.25 


upregulate stage 


420650 


AA455706 


Hs.44581 


heat shock protein hsp70-related protein 


7.3 


0.09 


upregulate stage 


420654 


AA279091 


Hs.1 04420 


ESTs 


1 


0.27 


upregutate stage 


420655 


R74405 


Hs.300886 


ESTs 


1 


1 


upregulate stage 


420717 


AA284447 


Hs.271887 


ESTs 


9 


0.09 


upregulate stage 


420734 


AW972872 


Hs.293738 


ESTs 


5.2 


0.13 


upregulate stage 


420756 


AA411800 


Hs.1 89900 


ESTs 


1 


1 


upregulate stage 


420789 


AI670057 


Hs.1 99882 


ESTs 


8.85 


0.06 


upregulate stage 


420802 


U22376 


Hs.1 334 


v-myb avian myeloblastosis viral oncogen 


4.6 


0.12 


upregulate stage 


420851 


AA281062 


Hs.250734 


ESTs 


8.35 


0.08 


upregulate stage 


420860 


AI609621 


Hs. 105620 


ESTs 


1 


1 


upregulate stage 


420923 


AF097021 


Hs.273321 


differentially expressed in hematopoiefi 


10.4 


0.03 


upregutate stage 


420928 


AA281809 




gbiztlOeOUl NCJ_CGAP_GCB1 Homo sapiens 


1 


1 


upregulate stage 


420936 


AA456112 


Hs.99410 


EST6 


8.71 


0,07 


upregulate stage 


420947 


AA491044 


Hs.47196 


ESTs 


1 


0.38 


upregulate stage 


421017 


AW979181 


Hs.293221 


ESTs, Weakly similar to AlU1_HUMAN ALU S 


1 


1 


upregulate stage 


421064 


AI245432 


Hs.101382 


tumor necrosis factor, alpha-induced pro 


1.26 


0.62 


upregulate stage 


421070 


AA283185 


Hs.1 9327 


ESTs 


22 


0,14 


upregutate stage 


4z 1 1 UU 


AW JO lOO J 




linmn carrion* rDNA' PI 191 7R3 fie Hnrw> P 
norno Sapiens cuctn. ru< i iqo ns, aone v* 




0,28 




421102 


AM70093 


Hs.89217 


ESTs 


2.65 


0*19 


upregulate stage 


421103 


AI625835 


Hs.27104 


ESTs 


6 


0.1 


upregulate stage 


421114 


AW975051 


Hs.293156 


ESTs 


4.7 


0.12 


upregulate stage 


421118 


A1471925 


Hs.89257 


ESTs 


1 


0.39 


upregulate stage 


421155 


K87879 


Hs.102267 


rysyl oxidase 


1.15 


0.18 


upregulate stage 


421159 


AW978316 


Hs.136649 


ESTs 


1 


0.44 


upregulate stage 


421187 


NM_014721 


Hs.102471 


KIAA0680 gene product 


5.7 


0.11 


upregutate stage 


421218 


NM.000499 


Hs.72912 


cytochrome P450, subfamily I {aromatic c 


ao7 


2.55 


upregulate stage 


421221 


AW276914 


Hs.300877 


ESTs 


a75 


0.07 


upregulate stage 


421229 


A1056590 


Hs.7086 


Homo sapiens cONA: FU23000 fis, done L 


1.64 


0.49 


upregutate stage 


421261 


AA600853 


Hs.98133 


ESTs 


10.9 


0.07 


upregulate stage 


421262 


AA286746 


Hs.9343 


Homo sapiens cDNA FU14265 fis, clone PL 


Z65 


0.16 


upregulate stage 


421278 


AB67919 


Hs.99691 


ESTs 


1 


0.56 


upregulate stage 
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491?flA 


AAAHRnZ 
AAO 1 loVH 




nh'AhAQant cl hid CGAP GCB1 Homo saofens 


421282 


AA2B5914 


Hs. 183299 


ESTs 


421306 


AA8US2Q7 


HS.125BB9 


ESTs 


421308 


AA687322 


Hs. 192843 


ESTs 


421373 


AA808229 


Hs.1 67771 


ESTs 


421079 


Y 15221 


ns.1u.jya2 


small inouciuie cyiuwiw suuidnuij d 


421381 


AA381752 




gb:cbUlJl4 l -Cell lyiuprtOma nomo sapiens 


421418 


AAbUooOs 




QD.ODOOQUu.S 1 Nw.ounr.owl nonio odpjens 


421433 


Al 829 192 


HS.1 <345Qo 


ESTs 


421451 


AA291377 


HS.bOojl 


eqTS 


421491 


H99999 


u» Avne 
HS.4Z735 


bo IS 


421493 


BE300341 


n. 4/vAAnc 

KS.104925 


ectooermawi sural corcox [win a i b-uxb 


421559 


NM_014720 


Hs.1 05751 


Sta20-relaled serine/threonine kinase 


421577 


BE465451 


Hs.1 05925 


single-minded (Drosopnila) nomolog l 


421673 


H54384 


Hs.36892 


ESTs 


421685 


AF1 89723 


Hs.106778 


ATPase, Ca++ transporting, type 20, memb 


421708 


AW754341 




gD:CMO-CT0341 -181 2s^l oO-riu oiUo4i norno 


421733 


All 19671 


Hs.1 420 


fibroblast growui factor receptor 3 (acn 


421838 


AW881089 


Hs.108806 


Homo sapiens mRNA; cDNA DKFZpoooMUwr (t 


421869 


A8003592 


1 I— 4 AAA rrt 

Hs.1 09050 


contactin 6 


421925 


S80310 


il 4 aaaaa 

Hs.1 09620 


acidic epididyma! glyooproteirvlike 1 


421948 


L42583 


Ks.11 1758 


keratin 6A 


421958 


A A *5C"J4 AC 

AA357185 


Hs.lOaslo 


ras homolog gene family, member H 


421991 


NM.014918 


ii 4 4 A Ann 

Hs.1 10488 


KIAA099D protein 


422026 


U80736 


Hs.1 10826 


trinucleotide repeat containing 9 


422072 


AB018255 


U_ 4 4 4 4 nO 

Hs. 111138 


1/1 A AHT41 n.A^tlAt 

WAAU/ 12 gene proauw 


422094 


AF129535 


1 1— ATAAA"» 

H5.272027 


F-box only protein 5 


422158 


L10343 


Hs.112341 


protease inhibitor 3, skin-derived (5KAL 


422168 


AA586894 


1 1„ j 4 una 

Hs.1 12408 


eiWlnolniiim Unit.. nfrtfoin A 7 /nrftrloe 

biuo cataum-Dinoing protein A/ (psonas 


422182 


AL043892 


Hs. 180582 


Homo sapiens cDNA: FLJ21836 fis, clone H 


422204 


AA339015 




gb:EST44247 Peia) wain I Homo sapiens c 


422261 


AA307595 


Hs.1 19908 


nucteotar protein NOP5/NOP58 


422271 


AB038995 


11 — 4 4 4 4 r A 

Hs.114159 


RAB-85 protein 


422278 


AF072873 


Hs.1 14218 


frizzled (Drosopnila) nomolog 6 


422282 


AF019225 


Hs.1 14309 


apolipoprotein L 


422322 


AB022192 


Hs.1 15240 


peroxisome biogenesis factor 13 


422330 


D30783 


I J — 44tAPA 

Hs.1 15263 


eplregulin 


422342 


AA309272 




gb:EST1 80209 Liver, nepatoceJlular carci 


422406 


AF025441 


4 4 MAC 

Hs. 11 6206 


Opa-interacting protein 5 


422487 


A tA 4 f\r\f%4 

AJ0 10901 


LI- 4flQ1£^ 

Hs. 198267 


mucin 4, tracheobronchial 


422491 


AA338548 


11—44 Tr 4A 

Hs.1 17546 


neuronatin 


422504 


AA31 1407 




gbicSTi 82167 JmKSi 1 -ceils V Homo sapte 


422505 


AL1 20862 


Hs.1 241 65 


ESTs 


422508 


AJ000327 


Hs.1 17852 


ATP-binding cassette, sub-family D (ALD) 


422530 


AW972300 


Hs.1 181 10 


bone marrow stromal cell antigen 2 


422540 


A1050751 


Hs.22895 


LUaiAiiiiiJaiui »nM A • CI 1 lIC AO. fir- rAnnn I 

Homo sapiens cuna: rU 23548 ns, done l 


422588 


AA312730 




gb.ESTI 83651 Monocytes, stimulated il Ho 


422678 


AA247778 


Hs.119155 


Homo sapiens mRNA; cDNA DKFZp434B249 (fr 


422762 


AL031320 


Hs.1 19976 


Human ONA sequence from done RP1-20N2 o 


422809 


AK001379 


Hs.1 21 028 


nypotneucal protein FU 10549 


422823 


D89974 


Hs.1 21 102 


vanln2 


422892 


AA988176 


Hs.1 21553 


hypothetical protein FU2064 1 


422938 


NMJJ01B09 


Hs.1 594 


centromere protein A (17kD) 


422964 


AW439476 


Hs.256895 


ESTs 


422981 


AF026445 


Hs. 122752 


TATA box binding protein (TBP)-associate 


423001 


AA320014 


Hs.208603 


ESTs 


423090 


BE387529 


Hs.123536 


melanoma antigen, family E, 1, cancer/te 


423100 


AA323114 




gb:EST25873 Cerebellum II Homo sapiens c 


423121 


AW864848 




gb:P^-SN0018-290oOO-003-C09 5N0018 Homo 


423156 


AA131493 


Hs.1 24752 


tibrobiast growtn lacior uu 


423198 


M81933 


Hs.1634 


ceil division cycle 25A 


423217 


NMJI00094 


Hs.1640 


collagen, type VII, alpha 1 (epidermolys 


423262 


NMJJ05479 


Hs.1 26057 


frequently rearranged in advanced T-cell 


42j290 


MW90 M3i} 




Hnmfl canipnc; cDMA' FU22219 fis dons H 

null VJ sajjtciuj i(Uiw i i-Jitt 19 who 1 1 


423309 


BE006775 


Hs.126782 


sushi-repeat protein 


423347 


AI660412 


Hs.234557 


ESTs 


423359 


NM.014170 


Hs.127496 


HSPC135 protein 


423368 


AA364195 




gb:EST75015 Pineal gland tl Homo sapiens 


423389 


A1471609 


Hs.54347 


ESTs 


423430 


AF112481 


Hs.128501 


RA054, S. cerevisiae, homolog of, B 


423441 


R68649 


Hs.278359 


absent in melanoma 1 like 


423453 


AW450737 


Hs.128791 


CG1-09 protein 


423500 


AF020763 


Hs.129705 


clone 1900 unknown protein 


423578 


AW960454 


Hs.222830 


ESTs 


423629 


AW021173 


Hs.18612 


Homo sapiens cDMA: RJ21909 fis, clone H 


423632 


AA328824 




gb£ST32358 Embryo, 12 week I Homo sapie 


423642 


AW452650 


Hs.157148 


Homo sapiens cDNA FU 1 1 883 fis, done HE 
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423544 


AA329048 




gb,EST32875 Embryo, 12 week I Homo sapte 


* 
i 


A A-i 


upregutate stage 


423646 


AKD0Q456 


Hs. 130546 


hypothetical protein FU20449 


IU.4 


U.U7 


upregulstB stage 


423651 


AF192913 


Hs.1 30683 


zinc finger prolan 1 80 (MHZ1 58) 


5.75 


U.1 


upregulate stage 


423654 


A! 674 253 


Hs.35828 


ESTs 


3.15 


U.IO 


upregutate stage 


423673 


BE003054 


Hs.1695 


matrix metalloproteinase 12 (macrophage 


29.7 


0.02 


upregulate stage 


423725 


AJ403108 


Hs.1 321 27 


hypolhelical protein LOC57822 


4.1 


0.16 


upregulate stage 


423745 


AI809797 


Hs.43222 


ESTs 


1 


0,5 


upregulate stags 


423748 


All 49048 


Hs.3021 1 


hypothetical protein FU22313 


4.25 


A it 


upregulate stage 


423753 


Y11312 


Hs. 132463 


phospholnositide-3-kinasB, class 2, beta 


4 40 
1.10 


n 74 

0.71 


upregulate stage 


42375B 


AA338153 


Hs.82124 


lamlnin, beta 1 


1 


4 
1 


upregulate stage 


423774 


L39064 


Hs.1702 


interteukin 9 receptor 


1 no 
J.Uo 


n 4c 


upregutate stage 


423818 


AA332439 




gb:EST36554 Embryo, 8 week I Homo sapien 


1 


U.oo 


upregulate stags 


423827 


A1472828 


Hs. 172625 


ESTs 


1 


0.43 


upregulate stage 


423837 


AW937053 




gb:PM3-DT0037-231299-001-g11 DT0037Homo 


1.44 


0.55 


upregulate stage 


423912 


BE091233 




gb:PMWT0726-30O30r>O01-H07 BT0726 Homo 


1 


1 


upregulate stage 


423938 


AL049328 


Hs. 135642 


Homo sapiens mRNA; cDNA 0KFZp564E026 (fr 


1 


1 


upregulate stage 


423942 


AF2097O4 


Hs. 135723 


gtycolipld transfer protein 


11.65 


e\ fie 


upregulate stage 


423944 


T91433 


Hs.1 28291 


phosphodiesterase 10A 
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0.45 


upregulate stage 
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Hs.1 35892 


Homo sapiens mRNA; cDNA OKFZp761l 1 31 1 (f 
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1 


upregulate stage 
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W28203 


Hs. 1361 69 


Homo sapiens clone 25215 mRNA sequence, 


5.35 


0.09 


upregulate stage 
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Hs.1 37548 


C084 antigen (leukocyte antigen) 


8.8 


0.06 


upregulate stage 


424008 
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Hs.1 37555 


putative chemokine receptor; GTP-binding 


3.14 


0.19 


upregulate stage 
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AW368377 


Hs.1 37569 


tumor protein 63 kDa with strong homoJog 


2.6 


0.26 


upregulate stage 


424073 


U03493 


Hs.1 38959 


gap junction protein, alpha 7, 45kD (con 


1.8 


0.22 


upregulate stage 


424075 


AI807320 


Hs.227630 


RE1 -silencing transcription factor 


9.1 


0.06 


upregulate stage 


424087 


N69333 


Hs.21638 


ESTs 
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1 


upregulate stage 


424193 


AK002005 


Hs. 142868 


Homo sapiens cDMA FU 1 1 143 fis, clone PL 
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0.23 


upregulate stage 


424353 


AA339646 




gb:EST44755 Fetal brain l Homo sapiens c 
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1 


upregulate stage 


424364 


AW383226 


Hs.201189 


ESTs, Weakly similar to DRPLA [Ksapiens 


2.18 


0.33 


upregulate stage 


424406 


D54120 


Hs.1 46409 


wingless-type MMTV Integration site fami 


2.05 


0.17 


upregulate stage 


424420 


BE614743 


Hs.1 46688 


prostaglandin E synthase 


1.19 


0.67 


upregulate stage 


424425 


AB031480 


HS.146B24 


SPR1 protein 


1.42 


0,54 


upregulate stage 
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BE002477 


Hs.278714 


chloride intracellular channel 6 
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0.27 


upregulate stage 
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AJ278016 


Hs.55565 


ankyrin repeat domain 3 


2.02 


0.39 


upregutate stage 
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Ai 133482 


Hs.165210 


ESTs 


3.15 


0.14 


upregulate stage 
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Hs.124918 


Homo sapiens cDNA FU13186 fis, clone NT 
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Hs.1 49894 


mitochondrial translation^ initiation f 
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AL110217 
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DKFZP572C1 63 protein 
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0.26 


upregulate stage 


424589 
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gb:RC3CT0254-100500-21 1-c03 CT0254 Homo 
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0.94 


upregulate stage 


424602 


AKO02O55 


Hs.301129 


Homo sapiens clone 23859 mRNA sequence 


2.85 


0.2 


upregulate stage 


424625 


AW904466 


Hs.151310 


PDZ domain protein (Drosophfla inaD4ike 


1 


0.4 


upregulate stage 


424629 


M9C656 


Hs.1 51 393 


glutamale-cysleine iigase, catalytic sub 


1.41 


0.52 


upregulate stage 


424643 


AF241850 


It— 4 E4 A in 

Hs.15142B 


ret finger protein 2 


9.75 


0.07 


upregulate stage 


424649 


BE242035 


Hs.1 51461 


embryonic ectoderm development 


5.85 


0.13 


upregulate stage 


424653 


AW977534 


Hs.1 51469 


calcium/calmodulirwlependent serine prat 


1 


0.56 


upregulate stage 


424670 


W61215 


Hs.1 16651 


epithelial V-iike antigen 1 


1.42 


0.52 


upregulate stage 


424690 


BE538356 


Hs.1 51 777 


■ i i i_^n^^ Lilian f-.u, *ir 

Human translation initiation factor elF- 


4.3 


0.1 


upregulate stage 


424701 


NMJ05923 


Hs.151988 


mitogen-activated protein kinase kinase 


3.85 


0.11 


upregulate stage 


424702 


AF250237 


Hs.1 52009 


G protein-coupled receptor 85 


1 


1 


upregulate stage 


424717 


H03754 


Hs.1 5221 3 


wingless-type MMTV integration site fami 


4.05 


0.09 


upregulate stage 


424735 


U31875 


Hs.1 52677 


Homo sapiens cDNA FU20338 fis, clone HE 


1.04 


0.59 


upregutate stage 


424736 


AF230877 


Hs.152701 


microtubule-uiteracting protein that ass 


1.11 


0.65 


upregulate stage 


424749 


NMJW2451 


Hs. 152817 


methylthloadenosine phiosphorylase 


1 


1 


upregulate stage 


424834 


AKO01432 


Hs. 153408 


Homo sapiens cDNA FU10570 fis, clone NT 


18.5 


0.03 


upregulate stage 


424841 


AI280215 


Hs.96885 


ESTs 


1 


1 


upregulate stage 


424860 


W6Q828 


Hs.1 53529 


Homo sapiens done 24540 mRNA sequence 


1 


1 


upregulate stage 


424878 


H57111 


Hs.221132 


ESTs 


9.45 ' 


0.07 


upregulate stage 


424879 


AA348013 


Hs.159354 


ESTs 


10,7 


0.07 


upregulate stage 


424888 


AA348126 


Hs.24882 


ESTs 


2.8 


0.21 


upregulate stage 


424905 


NM_002497 


Hs.1 53704 


NIMA (never in mitosis gene a)-related k 


7.75 


0.07 


upregulate stage 


424930 


AABB5344 


Hs.95910 


ESTs 


1.45 


0.38 


upregulate 6tage 


424948 


AA346810 


Hs.190503 


ESTs 


3.2 


0.12 


upregulate stage 


424951 


AWa&40o2 




go.c.bi o/biba wiAot resequences, MAbn rtomo 


O.IO 


a no 


UjJfcyUlalD a lays 


424993 


F07625 




gb:HSC2CF021 normalized infant brain cDN 


1 


1 


upregulate stage 


425020 


U09368 


Hs.154205 


zinc finger protein 140 (clone pHZ-39) 


1 


1 


upregulate stage 


425024 


R39235 


Hs.12407 


ESTs 


2.65 


0,13 


upregulate stage 


425057 


AA826434 


Ns.96944 


ESTs 


1 


0.22 


upregulate stage 


425068 


AL048716 


Hs.154387 


K1AA0103 gene product 


9.5 


0.07 


upregutate stage 


425081 


X74794 


Hs.154443 


miruchrornosome maintenance deficient (S. 


1.66 


0.52 


upregulate stage 


425191 


AF052148 


Hs.155085 


Homo sapiens clone 24653 mRNA sequence 


1 


0.32 


upregutate stage 


425216 


U81504 


Hs.155172 


adaptor-related protein complex 3, beta 


7.05 


0.1 


upregulate stage 


425234 


AW152225 


Hs.165909 


ESTs 


19.7 


0.04 


upregulate stage 


425239 


BE567924 


Hs.155244 


pre-mRNA splicing factor similar to S. c 


1 


0.69 


upregulate stage 
upregulate stage 


425289 


AW139342 


Hs.155530 


interferon, gamma-inducibte protein 16 


10.45 


0.05 


425304 


AA463844 


Hs.31339 


fibroblast growth factor 11 


1.57 


0.51 


upregulate stage 


425316 


AA354977 


Hs.191565 


ESTs. Moderately similar to NSD1 protein 


5.05 


0.12 


upregulate stage 
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425322 


U63630 


Hs. 155637 


protein kinase, UNA-acuvaiea, caiajyuc 


14 7 


u.uo 


upreguisio stage 


425362 


AA355936 




gb:EST6441 0 Jurkat T-cells VI Homo sapie 


* 
1 


1 


upregulate stage 


425397 


J04088 


Hs. 155346 


topolsomerase (DMA) II alpha {170kD} 


8.24 


0.09 


upregulate stage 


425403 


AL023753 


Hs. 155406 


Human UNA sequence from ctona 1 1 aonb on 


1 


[).££ 


upregulate stage 


425415 


M13903 


Hs, 157091 


Invotucrin 


1.19 


0.55 


upregulate stage 


425420 


8E536911 


Hs.234545 


ESTs, Weakly similar to AH 551 35 1 novel 




U.lo 


upregulate stage 


425463 


AK000740 


Hs.1 57986 


hypothetical protein FU20733 


9 


0.07 


upregulate stage 


425465 


118964 


Hs.1904 


protein kinase C, Iota 


9.6 


0.07 


upregulate stage 


425467 


R16484 


Hs. 190075 


ESTs 


1 


0.83 


upregulate stage 


425492 


AL021918 


Hs.158174 


zinc finger protein 184 (KmppeWke) 


3^ 


0.15 


upregulate stage 


425607 


U09860 


Hs. 158333 


protease, serine, 7 (enleroxinase) 


1 


1 


upregulate stage 


425608 


M360486 


Hs.92448 


ESTs 


4.7 


0.14 


upregulate stage 


425614 


AI 334 963 


Hs.156256 


ESTs 


2.65 


0.14 


upregulate stage 


425641 


079758 


Hs.14355 


Homo sapiens cONA FU 13207 fis, clone NT 


4.86 


0.1 


upregulate stage 


425660 


AA521184 


Hs.105504 


ESTs 


1 


0.31 


upregulate stage 


425665 


AK001050 


Hs.159066 


hypothetical protein FU101B8 


1.25 


0.19 


upregulate stage 


425672 


AA361483 




gb:EST70790 T-celi lymphoma Homo sapiens 


1 


1 


upregulate stage 


425707 


AF115402 


Hs.11713 


E74-Iike factor 5 (ets domain transcript 


3.96 


0.13 


upregulate stage 


425726 


AF0B5808 


Hs.159330 


uraplakin 3 


0.92 


0.79 


upregulate stage 


425742 


AJ001454 


Hs. 159425 


lestican 3 


1 


1 


upregulate stage 


425785 


T27017 


Hs.159528 


Homo seplens clone 24400 mRNA sequence 


1 


0.39 


upregulate stage 


425811 


AL039104 


Hs.159557 


karyopherin alpha 2 (RAG cohort 1 , Impor 


1.89 


0.44 


upregulate stage 


425843 


BE313280 


Hs.159627 


death associated protein 3 


3.1 


0.15 


upregulate stage 


425852 


AK0O15O4 


Hs.159651 


death receptor 6 


1.72 


0.47 


upregulate stage 


425883 


AL1 37708 


Hs.161031 


Homo sapiens mRNA: cDNA DKFZp434K0322 (f 


0.95 


0.68 


upregulate stage 


426010 


AA136563 


Hs.1975 


Homo sapiens cDNA: FU21007 Ms, done C 


1 


0.34 


upregulate stage 


426028 


NMJWWO 


Hs.172028 


a disintegrin and metalloproteinase doma 


14.3 


0.04 


upregulate stage 


426101 


AL0499B7 


Hs.156361 


Homo sapiens mRNA; cONA DKF2p564F1 1 2 (fr 


11.75 


0.05 


upregulate stage 


426108 


AA622037 


Hs.1 66468 


programmed cell death 5 


3.23 


0.18 


upregulate stage 


426115 


H08B95 


Hs.1 66733 


leucylfcystinyl aminopeptldase 


1 


0.32 


upregulate stage 


426168 


NMJM33152 


Hs.1 67503 


signal transducer and activator of trans 


1.97 


•0.4 


upregulate stage 


426257 


AL1 37201 


Hs.1 68625 


KIAA0979 protein 


1 


0.29 


upregulate stage 


426261 


AW242243 


Hs.16B670 


peroxisomal famesylated protein 


2.8 


0.16 


upregulate stage 


426283 


NM»003937 


Hs.169139 


kynureninase (L-kynurenine hydrolase) 


14.75 


0.04 


upregulate stage 


426451 


AI908165 


Hs.1 69946 


GATA-binding protein 3 


3.05 


0.28 


upregulate stage 


426462 


U59111 


Hs.169993 


dermatan sulphate proteoglycan 3 


1 


0.36 


upregulate stage 


426490 


NMJJ01621 


Hs.1 70087 


aryi hydrocarbon receptor 


14.17 


0.05 


upregulate stage 


426514 


BE616633 


Hs.301122 


bone morphogenetic protein 7 (osteogenic 


1.15 


0.47 


upregulate stage 


426561 


AA381437 




gb:EST9451 4 Activated T-cells i Homo sap 


5.65 


0,11 


upregulate stage 


426711 


AA383471 


Hs.180669 


conserved gene amplified in osteosarcoma 


11.05 


0.05 


upregulate stage 


426731 


AW303411 


Hs.1 30332 


ESTs 


2.4 


0.21 


upregulate stage 


426759 


AI590401 


Hs.21213 


ESTs 


9,5 


0.06 


upregulate stage 


426786 


AA319798 


Hs.172247 


eukaryolic translation elongation factor 


9.25 


0.09 


upregulate stage 


426788 


U66615 


Hs.172280 


SWUSNF related, matrix associated, acB 


5.63 


0.14 


upregulate stage 


426818 


AA554827 


Hs. 124841 


ESTs, Weakly similar to ALU5_HUMAN ALU S 


9 


0.08 


upregulate stage 


426824 


087717 


Hs.172652 


KlAA0013gene product 


1 


0.87 


upregulate stage 


426827 


AW067805 


Hs.1 72665 


methylenetetrahydrofolate dehydrogenase 


8.95 


0.09 


upregulate stage 


426921 


AA037145 


Hs.1 72865 


cleavage stimulation factor, 3' pre-RNA, 


1 


0.19 


upregulate stage 


426935 


NMJJ0QQ88 


Hs.1 72928 


coDagen, type 1, alpha 1 


1.15 


0.72 


upregulate stage 


426997 


BE620738 


Hs.173125 


peptidylprolyl isomerase F (cyclophilin 


11.06 


0.06 


upregulate stage 


427071 


AA397958 


Hs.192719 


ESTs 


5.75 


0.08 


upregulate stage 


427126 


AA620613 


Hs.191827 


ESTs 


2.55 


0.18 


upregulate stage 


427134 


AA398409 


Hs. 173561 


EST 


3.4 


0.18 


upregulate stage 


427142 


AA398510 


Hs.133148 


ESTs 


1 


0.25 


upregulate stage 


427259 


AA400096 




gb:zu69f07.s1 SoaresJeslis_NHT Homo sap 


1 


0.22 


upregulate stage 


427308 


026067 


Hs. 174905 


K1AA0033 protein 


5.9 


0.1 


upregulate stage 


427315 


AA1 79949 


Hs.175563 


Homo sapiens mRNA; cONA DKFZp564N0763 (f 


6.12 


0.11 


upregulate stage 
upregulate stage 


427356 


AW023482 


Hs.97849 


ESTs 


2.7 


0.13 


427370 


AI243615 


Hs.97740 


ESTs 


3.6 


0.14 


upregulate stage 


427376 


AA401533 


Hs.1 9440 


ESTs 


Z1 


0.16 


upregulate stage 


427387 


BE244966 


Hs.177584 


3-oxoacid CoA transferase 


1 


0.39 


upregulate stage 


427470 


AW999924 


Hs. 178357 


Homo sapiens cDNAFU 13657 fis. clone PL 


3.4 


0.16 


upregulate stage 


427519 


AW085233 


Hs.1 80696 


ESTs 


8.23 


0,1 


upregulate stage 


427521 


AWy7330£ 




CO IS 


7 7*1 


n 1 


upreg ulcus bujyt; 


427528 


AU077143 


Hs.179565 


rrunicrtromosorne maintenance deficient (S. 


5.7 


0.15 


upregulate stage 


427556 


AI743515 




gb:wf72b08.x2 Soares_NFl_T_GBC_S1 Homo s 


1 


1 


upregulate stage 


427581 


NMJ)14768 


Hs.179703 


KIAA0129gene product 


11.45 


0.06 


upregulate stage 


427585 


031152 


Hs.179729 


collagen, type X, alpha 1 (Schmid metaph 


3.25 


0.16 


upregulate stage 


427603 


AI09083B 


Hs.98006 


ESTs 


1 


1 


upregulate stage 


427646 


AJ878042 


Hs.271953 


ESTs 


SJ 


an 


upregulate stage 


427652 


A1673025 


Hs.43874 


ESTs 


1 


0.34 


upregulate stage 


427742 


AA411880 


Hs.190888 


ESTs 


2.4 


0.16 


upregulate stage 


427814 


W28383 


Hs.180900 


WiUiams-Beuren syndrome chromosome regl 


9.13 


0.08 


upregulate stage 


427839 


AA608823 


Hs.98244 


ESTs 


1.9 


0.19 


upregulate stage 


427878 


C05766 


Hs.181022 


OGMJ7 protein 


4.1 


0.14 


upregulate stage 


427922 


AK001934 


Hs.181112 


HSPC1 26 protein 


Z6 


0.19 


upregulate stage 


427933 


AW974643 


Hs.190571 


ESTs 


4.55 


0.14 


upregulate stage 
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4£/3j4 


A A Q\f\CA i 


U. 0O40CC 


ESTs 


1 

i 


1 


upregulate stage 


A17QAA 


A A / 4 7Q7D 

AA4170MJ 


HS.4o41ji 


bois, weaKiy similar 10 aluojiuman alu o 


D. 13 


n 1 


upregulate stage 


JI07QO4 


AW2S3165 


Uf> 1404 OX 

HS.14J134 


fcoTS 


4.00 


n 11 

U. 11 


upregulate stage 


427986 


N45214 


HS.2B2387 


i)_ .__)_-,. .nil A , CI I04Q07 fie nUnn U 

Homo sapiens cDNA: ru2 itw/ ns, clone h 


O ce 


n 40 

U.IJ 


upregulate stage 


428003 


ALU 0200 


HS.1B1384 


Homo sapiens mRNA; cDNA DKFZp585B0922 (f 


1.45 


0.36 


upregulate stage 


428004 


AA449563 


Hs.300270 


ESTs 


3.95 


0.12 


upregulate stage 


428010 


AA806554 


Hs.185375 


ESTs 


1 


0.38 


upregulate stage 


428057 


AJ343641 


Hs. 185798 


ESTs 


1U.1 


U.Uo 


upregulate stage 


428058 


AJ821625 


Hs. 191602 


ESTs 


1 


0.5 


upregulate stage 


428071 


ft m4 AO 1 ft 

AF212848 


Hs. 162339 


ets homologous factor 


6.4 


0.09 


upregulate stage 


428182 


BE386042 


1 t AA«WM A mm 

Hs.293317 


ESTs, Weakly similar to JM27 [H.sapiens] 


1 


0.23 


upregulate stage 


428192 


AA424051 




0ozv8OdO3j1 Soares_toiajeao_Nb2HF8_ 


245 


0.16 


upregulate stage 


428227 


AA321649 


Hs.2248 


small inducible cytokine subfamily B (Cy 


a oc 
9.25 


0.04 


upregulate stage 


428403 


AI393048 


Hs.239894 


leucine rich repeat (in FUI) interactin 


9.94 


0.06 


upregulate stage 


428436 


BE080180 




gb:RC4-BTQ629-1 20200-01 1-b10 BT0529 Homo 


1 


1 


upregulate stage 


428450 


Lift 1 AJ i^Ai 

NM_014791 


Hs. 184339 


l/l A A/14 7C 1- , ,,4 

KIAA01 75 gene product 


A JtO 

4.43 


0.16 


upregulate stage 


428479 


Y00272 


Hs. 184572 


cell division cycle 2, G1 to S and G2 to 


9.2 


0.07 


upregulate stage 


428529 


AW262022 


Hs.106278 


Homo sapiens cDNA FLJ 12839 fis, done NT 


1 


1 


upreguiate stage 


428576 


AW009330 


Hs. 167621 


ESTs 


1 


0.3 


upregulate stage 


428605 


AB037862 


Hs. 186756 


K1AA1441 protein 


9.25 


0.09 


upregulate stage 


428664 


AK001666 


Hs.1 89095 


similar to SALL1 (sal (DrosophOa)-Jike 


3.8 


0.08 


upregulate stage 


428685 


AF131853 


Hs.1 89527 


Homo sapiens done 2501 6 mRNA sequence 


1 


1 


upregulate stage 


428716 


AL122118 


Hs.190614 


Homo sapiens mRNA: cDNA DKFZp43401221 {f 


1 


0.65 


upregulate stage 


428783 


AW070204 


Hs.1 781 76 


ESTs 


1.6 


0.23 


upregulate stage 


428788 


AF082283 


Hs.193516 


B-cell CLL/lymptioma 10 


9.6 


0.08 


upregulate stage 


428829 


R 14050 


Hs. 194051 


ii— _i rivu nil * r\i/r^FAAn^4 4 it- 

Homo sapiens mRNA; cDNA 0KFZp566B213 (fr 


5.45 


0.11 


upregulate stage 


428839 


AI767756 


Hs.82302 


ESTs 


10 


0.06 


upregulate stage 


428881 


AI29B368 


Hs.98918 


ESTs 


1.4 


0.18 


upregulate stage 


428954 


AF1 00781 


Hs.1 94678 


WNT1 Inducible signaling pathway protein 


1 


1 


upregulate stage 


428988 


AA442900 


Hs.27947 


ESTs 


3.05 


0.13 


upreguiate stage 


429042 


AW015489 


Hs.235920 


ESTs 


1 


0.56 


upregulate stage 


429057 


AF1 56557 


Hs.194816 


stomatin-like protein 1 


0.95 


0.93 


upreguiate stage 


429066 


AA868555 


Hs.1 78222 


ESTs 


6 


0.11 


upregulate stage 


429072 


AI37622B 


Hs. 108043 


Friend leukemia virus integration 1 


1 


1 


upreguiate stage 


429083 


Y09397 


Hs.227817 


BCL2-related protein A1 


11.12 


0.03 


upregulate stage 


429091 


AA935658 


Hs.1 87939 


ESTs 


8.9 


0.08 


upregulate stage 


429115 


AA446728 


Hs.289020 


Homo sapiens cDNA FU 14098 (is, clone MA 


4.1 


0.17 


upregulate stage 


429127 


AA749382 


Hs.107233 


ESTs 


1 


0.23 


upregulate stage 


429135 


AA446966 


Hs.99090 


ESTs, Moderately similar to similar to K 


1 


1 


upregulate stage 


429170 


NMJJ01394 


HsJ2359 


dual specificity phosphatase 4 


8.6 


0.08 


upregulate stage 


429174 


BE559598 


Hs.197803 


KIAA0160 protein 


8.4 


0.06 


upregulate stage 


429236 


AA448407 




gbzw68d1 1.s1 Scares Jestis_NHT Homo sap 


1 


0.36 


upreguiate stage 


429268 


AA205388 


Hs,198481 


RAR-related orphan receptor B 


2.9 


0.16 


upregulate stage 


429300 


AB011108 


Hs.198891 


serinerthreonine-protein kinase PRP4 horn 


4.25 


0.15 


upregulate stage 


429334 


D63078 


Hs.186180 


Homo sapiens cDNA: FU23038 fis, clone L 


2.95 


0.11 


upregulate stage 


429344 


R94038 


Hs.199538 


Inhlbln, betaC 


2.91 


0.28 


upregulate stage 


429359 


W00482 


Hs.2399 


matrix metalioproteinase 14 (membrane-in 


1.19 


0.68 


upregulate stage 


429376 


AI867889 


Hs.43227 


ESTs 


1 


1 


upregulate stage 


429412 


NM.006235 


Hs.2407 


POL) domain, dass 2. assodaling factor 


8.15 


0.07 


upregulate stage 


429450 


AA824451 


Hs.94292 


Homo sapiens cDNA: RJ2331 1 Hs, clone H 


3.3 


0.17 


upregulate stage 


429472 


AW452421 


Hs.1 5652 


ESTs 


1 


1 


upregulate stage 


429482 


AF076974 


Hs.203952 


transformation/transcription domain-asso 


1.52 


0.59 


upregulate stage 


429486 


AF1 55827 


Hs.203963 


hypothetical protein FU 10339 


2.9 


0.15 


upregulate stage 


429572 


AW295375 


Hs.39474 


ESTs 


1 


0.95 


upregulate stage 


429584 


A1817785 


Hs.1 83037 


protein kinase, cAMP-dependent, regulato 


6.55 


0.1 


upregulate stage 


429590 


A!219490 


Hs.44445 


ESTs, Weakly similar to Kelch motif cent 


1 


1 


upregulate stage 


429597 


NM.003816 


Hs.2442 


a disintegrin and metalioproteinase doma 


5.6 


0.14 


upregulate stage 


429601 


AI804293 


Hs.1 19406 


ESTs. Weakly similar to AF1 43946 1 trans 


1.36 


0.58 


upregulate stage 


429602 


AA521463 


Hs.1 83424 


ESTs 


1 


0.34 


upregulate stage 


429617 


X89984 


Hs.211563 


B-cell CLUIymphoma 7A 


10.8 


0.07 


upregulate stage 


429629 


BE501732 


Hs.30622 


Homo sapiens cDNA FU 13010 fis, done NT 


3.4 


0.12 


upregulate stage 


429631 


AA455612 


Hs.1 367 10 


EST 


1 


1 


upreguiate stage 


429644 


AA455892 


Hs.156379 


ESTs 


3.4 


0.15 


upregulate stage 
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malal-rent ilalnrv fran^crintinrv farfnr 1 


4.45 


0.17 




429664 


L20433 


Hs.211588 


POU domain, class 4, transcription facto 


1.17 


0J4 


upregulate stage 


429673 


AA884407 


Hs.211595 


protein tyrosine phosphatase, non-recept 


3.19 


0.22 


upregulate stage 


429699 


AJ383469 


Hs.159300 


ESTs 


4.4 


0.1 


upregulate stage 


429782 


NMJJ05754 


Hs.220689 


Ras-GTPase-activaiing protein SH3-domaln 


4.25 


0.13 


upregulate stage 


429813 


AW139678 


Hs.180791 


ESTs 


1 


0.95 


upreguiate stage 


429828 


AB019494 


Hs.225767 


IDN3 protein 


4.2 


0.14 


upregulate stage 


429838 


AW904907 


Hs.1 08241 


ESTs. Weakly similar to The K1AA01 91 gen 


3.25 


0.14 


upregulate stage 


429859 


NM 007050 


Hs.225952 


protein tyrosine phosphatase, receptor t 


1 


0.36 


upregulate stage 


429913 


AA460608 


Hs.99552 
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429970 AK000072 t&227059 chloride channel, calcium activated, fam 

430042 AB023170 Hs.227850 KIAA0953 protein 

430168 AW968343 Hs.300898 ESTs, Highly similar to AF1281 13 1 promi 

430308 BE5408S5 Hs^38990 cycfirKiependertt kinase inhibitor IB {p2 

5 430399 A1916284 Hs.199871 ESTs 

430763 AA485468 Hs. 105558 ESTs 

431474 AL133990 Hs.190642 ESTs 

431567 N51357 Hs^60855 Homo sapiens mRNA; cONA DKFZp761G231 1 (f 

431585 BE242803 Hs^62823 hypothetical protein FU 10326 

10 431683 AK001749 Hs.267604 hypolheticaJ protein FU 10450 

431709 AF220185 Hs.267923 uncharacterized hypothalamus protein HT0 

431846 BE019924 Hs.271580 uroplaWnIB 

431912 AI660552 Hs.154903 ESTs, WeaWy similar to A561 54 Abl subst 

432350 NIA.0058S5 Hs.274407 protease, serine, 16 (thymus) 

15 432520 AI075978 Hs.1 88007 ESTs 

432524 A1458020 Hs^93287 ESTs 

432540 AIB21517 Hs.105866 ESTs 

432623 AA557351 Hs.1 52448 ESTs, Moderately similar to PUR6.HUMAN M 

432632 AW973801 Hs.134656 ESTs 

20 432820 AI554057 Hs.1 52477 ESTs 

432945 AL043683 Hs.271357 ESTs, Weakly similar to unnamed protein 

433027 AF191018 Hs,279923 putative nucleotide binding protein, est 

433037 NM_01415B Hs.279938 HSPC067 protein 

433156 R59206 Hs.17519 Homo sapiens cDNA: FU22539 lis, clone H 

25 433171 AA579425 gb:nf37c08.s1 NCI_CGAP_Pr2 Homo sapiens 

43331 1 AA688149 gb:nv16h12.s1 NCLCGAP_Pr22 Homo sapiens 

433383 AF034837 Hs.192731 double-stranded RNA specific adenosine d 

433409 AI27B802 Hs.25661 ESTs 

433650 AA603472 Hs.28456 ESTs 

30 433675 AW977653 Hs.1 10771 Homo sapiens cDNA: FLJ21904 lis, done H 

434328 BE564937 Hs.15984 pp21 homolog 

434476 AW858520 Hs^71825 ESTs 

434683 AW298724 Ks.2u2639 ESTs 

434726 AF062719 Hs.139053 ESTs 

35 435124 AA725362 Hs.120456 ESTs 

435563 AF210317 Hs.95497 solute carrier family 2 {facilitated ghi 

435899 W89093 Hs.189914 ESTs 

436026 A1349764 Hs.217081 ESTs 

436154 AA764950 Hs.1 19899 ESTs 

40 436293 AI601188 Hs.120910 ESTs 

436361 AA825814 Hs.149065 ESTs 

436455 A1027959 Hs.132300 ESTs 

436577 W84774 Hs.17643 ESTs 

436684 AW976319 Hs.94806 KIAA1062protBin 

45 437036 A1571514 Hs.133022 ESTs 

437146 AA730977 gb:nw55f05.s1 Na_CGAP_Ew1 Homo sapiens 

437262 BE250537 Hs.174838 Homo sapiens cDNA FU14192 lis, clone NT 

437277 AA748016 Hs.123370 ESTs 

437882 AI243203 Hs.131572 ESTs 

50 438392 AA806395 Hs.123205 ESTs 

438416 K76398 Hs.21167 Homo sapiens cDNA: FU 23068 fis, done L 

438739 AA815391 gb:ai61c02.s1 Soares_testisJiHT Homo sap 

439211 A1890347 Hs.271923 Homo sapiens cDNA: FU 22785 fis, done K 

439394 AA149250 Hs.56105 ESTs. WeaWy similar to WDNM RAT WDNM1 P 

55 439544 W26354 Hs.28891 hypothetical protein FU 11 360 

439569 AW602166 Hs.222399 CEGP\ protein 

439586 AA922938 Hs.110039 ESTs 

439706 AW872S27 Hs.59761 ESTs 

439897 NM_015310 Hs.6763 KIAA0942 protein 

60 439898 AW505514 H&209561 ESTs, WeaWy similar to C05E1 1.1 gene pr 

439949 AW979197 Hs.292073 ESTs 

440035 BE561589 Hs.285122 hypothetical protein FU21 839 

440619 AW408586 Hs.91052 ESTs, Moderately similar to ALU5_HUMAN A 

440635 AW610331 gb:RW^T0316.19010(M)11^08ST0316Homo ' 

65 440787 AW292043 Hs.209433 ESTs 

441233 AA972965 Hs.135568 ESTs 

441528 AI003797 Hs.130815 hypothetical protein FU21 870 

441670 AW874090 Hs.127392 ESTs, Moderately similar to p33lNG1 [H.s 

441683 BE564214 Hs.102946 ESTs 

70 441847 A1215564 Hs.220972 ESTs 

442145 AI022650 Hs.8117 erbb2-tnteracting protein ERB IN 

442299 AW467791 Hs.155561 ESTs 

442315 AA173992 Hs.7956 ESTs 

442528 AF150317 Hs.134217 ESTs 

75 442571 C06338 Hs.165464 ESTs 

442607 AA507576 Hs.288361 Homo sapiens cONA: FU22695 fis, done H 

442652 AI005163 Hs.201378 ESTs, WeaWy similar to KIAA0944 protein 
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Hs.21303 ESTs 

Hs.44343 ESTs 

Hs.6614 ESTs 

Hs.164711 ESTs 

Hs.293798 ESTs 

Hs.215937 ESTs 

Hs.145489 ESTs 

Hs.199713 ESTs 

Hs.145679 ESTs 

Hs.293776 ESTs. Weakly similar to 1 207289A reverse 

Hs.134743 ESTs 

Hs.150757 ESTs 

Hs.10458 small inducible cytokine subfamily A (Cy 

Hs.282990 Human UNA sequence from clone RP1-28H20 

Hs.11085 C6M 11 protein 

Hs.282936 ESTs 

Hs.84520 ESTs 

Hs.143866 ESTs 

Hs.243122 hypothetical protein FU13057 similar to 

Hs.147170 ESTs 

Hs.12045 putauve protein 

Hs.208275 ESTs, Weakly similar to unnamed protein 

Hs.44737 ESTs 

Hs.148559 ESTs 

Hs.154131 ESTs 

Hs.147474 ESTs 

Hs.147482 ESTs 

Hs. 167011 Homo sapiens cDNA:FU21362fis, clone C 

Hs. 12940 zinc-fingers and homeocoxes 1 

Hs.13063 Iranscription factor CA150 

Hs. 13340 hlstoneaceryltransferase 1 

Hs.145582 ESTs 

Hs.26750 Homo sapiens cDNA: FU21908 lis, done H 

Hs.15299 HMBA-inducfole 

Hs.97179 ESTs 

Hs.161321 ESTs 

Hs.263912 ESTs 

Hs.158794 ESTs 

Hs. 18426 Iranslalional inhibitor protein p14.5 

Hs.18792 thioredoxin-Gke, 32kO 

Hs.7222 ESTs 

Hs.19385 CG1-58 protein 

Hs.164285 ESTs, Weakly similar to Afig 1 p [S.cerevis 

Hs.58608 Homo sapiens cDNA FU 14206 fis, clone NT 

Hs.94445 ESTs 

Hs.24908 ESTs 

Hs.13809 ESTs 

Hs.61773 Homo sapiens cDNA FU11648 fis, clone HE 

Hs.293690 ESTs 

Hs.178470 Homo sapiens cDNA: FU22662 fis. done H 

Hs.171442 ESTs 

Hs.7549 ESTs 

Hs.22580 afoylglycerone phosphate synthase 

Hs.57471 ■ ESTs 

Hs.23643 seririe/lhreonine protein kinase MASK 

Hs.197693 ESTs 

Hs.300647 ESTs 

Hs.198899 eukaryotic translation initiation factor 

Hs.301039 Human DNA sequence from clone RP1-132F21 

gb:wd15h01.x1 Soares_NFL_J_GBC_S1 Homo s 

Hs.273193 hypothetical protein FU10706 

Hs.32775 ESTs 

Hs.224623 ESTs 

Hs.288928 Homo sapiens cDNA; RJ23296 fis, done H 

Hs.17230 hypothetical protein FU22087 

Hs.103296 ESTs 

Hs.203330 ESTs 

Hs.60435 ESTs 

gbrtrlOhlZxl NCI_CGAP_Ov23 Homo sapiens 

Hs.25999 hypothetical protein FU22195 

Hs.26090 hypothetical protein FU20272 

Hs.26708 CGM21 protein 

Hs.26764 WAA0546 protein 

Hs.26789 hypothetical protein FU 10320 

Hs.43897 ESTs, WeaWy similar to P2CA.HUMAN PROTE 

Hs.297735 Homo sapiens cDNA: FU22094 fis, clone H 

Hs.172510 ESTs 
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hypothetical protein FU 10775 


3.95 


0.11 


upregulate stage 


453972 


AW1 37224 


Hs.245869 


ESTs 


6 


0.09 


upregulate stage 


454044 


AW022393 




gb:df37h12.y1 Morton Fetal Cochlea Homo 


1.15 


0.18 


upregulate stage 


454289 


AL1 37554 


Hs.49927 


Homo sapiens mRNA; cONA DKFZp434H1720 (f 


7.05 


0.1 


upregulate stage 


454314 


AW364844 




gb:QV3-DT0044-22129WM&c03 DT0044 Homo 


1 


0.37 


upregulate stage 


454315 


AW373564 


Hs.251928 


nuclear pore complex interacting protein 


2.7 


0.13 


upregulate stage 


454775 


BE160229 




gb:QV1-HT0413-090200-062-a12 HT0413 Homo 


8.5 


0.09 


upregulate stage 


454790 


AW820852 




gb:RC2-STQ301*1 20200-01 1-f12 ST0301 Homo 


1.15 


0.14 


upregulate stage 


454792 


AW820794 


Hs.252406 


hypothetical protein FU12296 similar to 


3.65 


0.12 


upregulate stage 


455170 


AW860972 




gb:Q\rtVCT0387-180300-167-ft07 CT0387 Homo 


5.7 


0.07 


upregulate stage 


455511 


BE144762 




gb:CM0-HT01 B(H)41099-065-b04 HT0180 Homo 


1 


0.25 


upregulate stage 


456141 


AI751357 


Hs.288741 


Homo sapiens cDMA: FU22256 lis, clone H 


8.35 


0.09 


upregulate stage 


456258 


AW976410 


Hs.2B9069 


Homo sapiens cONA: FU2101S fis. clone C 


4.85 


0.14 


upregulate stage 


456279 


AW006783 


Hs.6585 


ESTs 


7.25 


0.1 


upregulate stage 


457518 


AA825350 


Hs.143805 


ESTs, Weakly similar to ALU1.HUMAN ALU S 


6.84 


0.11 


upregulate stage 


457570 


AA579426 


Hs.190226 


ESTs 


2.6 


0.2 


upregulate stage 


457982 


AW856093 


Hs.183617 


ESTs 


1 


0.25 


upregulate stage 


458080 


BE142728 




gb:MR0-HT0157-021299-004<i08 HT0157 Homo 


2.05 


0.27 


upregulate stage 


458340 


AI457102 


Hs,121583 


Human glucose transporter pseudogene 


Z25 


0.18 


upregulate stage 


458440 


AI095468 


Hs.135254 


ESTs, Weakly similar to thrambospondin t 


Z35 


0.13 


upregulate stage 


458771 


AW295151 


Hs.163612 


ESTs 


1 


0.19 


upregulate stage 


459092 


AA722012 


Hs.255757 


ESTs, Weakly similar to KIAA061 1 protein 


6.95 


0.1 


upregulate stage 



TABLE IB 



Pkey: Unique Eos probeset identifier number 
CAT number. Gene cluster number 
Accession: Genbank accession numbers 



Pkey 


CAT 


Accessions 


407774 


101538J 


AA084958AA214034AA044587AA130152AA130116D81924 


407939 


1027688J 


W05608AW118352AW198215 


407980 


103087.1 


AA046309 AI263500 AA046397 


408224 


1048369.1 


AW1 75997 AW176000 AW175999 AW175994 AW176004 AW1759B9 


408241 


1048867.1 


AW176546AW178965BE141057 


408268 


104987 1 


AL138247AA053529BE173313 


408277 


1050131.1 


AW1 77959 AW177960 AW178066 AW1 78022 


408306 


1050863 1 


BE141991 AW17B855 BE141990 BE141993 BE141994 BE062405 BE062404 AW178850 


409281 


111502 1 


AA069998 AA075659 AA075660 AA069828 


409408 


1128045 1 


AW387837 AW387874 AW387826 AW387758 AW387864 AW809268 AW387754 


409566 


114012.1 


AA078899 AA078782 AA075788 


409760 


115373.1 


AA302840 T93016 T92950 AA077551 


409794 


1154785.1 


AWB85691 BE185989 BE185940 AW500322 H10282 


409810 


1155339.1 


AW500895AW937549 


409840 


1156071.1 


AW5021 22 AW5021 25 AW501663 AW501 720 


410071 


1174764.1 


AW582568 AW818556 AW818647 AW818655 AW816637 AW818234 


410128 


117773.1 


AW904599 AA325920 AW964919 AA0817B6T05150 


410269 


1189983 1 


AW613597 BE080235 BE080168 


410475 


1204788.1 


AW749927 AW749938 AW749986 AW749966 R87124 AW860547 


410495 


1205826 1 


N95428 W24040 AW751366 H81987 


410520 


1206965.1 


AW752710 BE180336 BE180188 


410534 


1207247.1 


AW905138 AW753008 R13818 Z43519 


410537 


1207336 1 


AVY7531 08 AW852909 N36993 AW894327 AW903629 


41053) 


1208311.1 


N29220 AW754225 AW754224 


410562 


1208415.1 


AW858528 AW754362 H74000 


410724 


1218202.1 


AW799269 AW799385 AW799501 AW799386 AW799270 AW799467 


410751 


121964.1 


AA357918 AA089516 AA357917 


410754 


1219733.1 


T63840AW801569AW801568 


410785 


1221055.1 


AW803341 AW803265 AW803403 AWB03456 AW803402 AW80341 3 AW803268 AW803395 AW803334 AW803355 


410844 


1223878.1 


AW807073 AW807055 AW807087 AW807276 AW807030 AW807363 AW845892 AW807O91 AW807275 AW807284 AW807287 AW845891 






AW807195AW807271 


410910 


1226517.1 


AW810204 AW810555 AWB10196 AW810619 AW810507 


410973 


1228236.1 


AWB12278 AW812286 AW812274 


410997 


1228736.1 


AW812877 AW812878 AW812952 AW812880 


411110 


1232598.1 


H93000AW818022AW818044 


411112 


1232692.1 


AW81 81 58 AW865743 AW865702 AW865698 


411132 


1233580.1 


AW819191 AW819252 AW819183 AW819175 AW8191tt AW819186 AWB1 91 80 BE158470 A 
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411137 
411157 
411159 
411162 
411170 
411173 
411193 
411242 



411245 
411282 



411327 
411336 
411339 
411407 
411418 
411425 
411461 
411474 
411526 
411560 
411568 
411691 
411727 
411741 
411771 
411707 
411788 
411826 
411840 
411860 
411880 
411932 
412134 
412178 
412296 
412303 
412305 
412312 
412327 
412354 
412357 
412359 
412367 

412389 
412547 
412559 
412671 
412744 
412838 
413119 
413141 
413266 
413294 
413342 
413402 



413485 
413508 
413618 
413707 
413743 
413782 
413792 
413804 
414091 
414203 
414220 
414276 



1233775.1 

1234099.1 

1234102.1 

1234161.1 

1234379.1 

123446.1 

1235254.1 

1236393.1 



1236412J 
1237660.1 



1238874.1 

1239216 1 

1239387.1 

1244071.1 

124518.1 

1245503.1 

1246730.1 

1247047.2 

1248748.1 

1249443.1 

1249776.1 

1254166.1 

1255052.1 

1255976J 

1257335.1 

1258789.1 

125879.1 

1260006.1 

1260678 1 

126168.1 

1263110.1 

1266125.1 

1279147.1 

1281471.1 

12B8043J 

1288130.1 

1288141.1 

1288349.1 

1288935.1 

1290342.1 

1290611.1 

129085.1 

1291505.1 

1292588.1 
1305813.1 
1306011.1 
1319959.1 
1324622J 
1331178 1 
1349658.1 
1350477.1 
1356260 1 
1359149 1 
136279.1 
1366932.1 



413445 1370833.1 



1372953.1 

1374273.1 

1380384.1 

1384137.1 

1385884 1 

1389068.1 

1389912.1 

1390710.1 

141621.1 

1425510.2 

1426940 1 

14321 15.-1 



AW819265 AW819268 AW819246 BE152602 AW819249 AW819251 AW819263 AW819194 

AW819455AW819711 AW819553 

AWB19867 AW819956 AW820159 

AW820178 AWB20183 AVM9870 AW819937 AW835^ 

AW819944 AW8201 82 AW820168 AW819959 AW819953 

AW820503 AW820306 AW820429 BE174741 AW820244 BE1 74738 

R81571AW962014AA324277 

AW821484 AW821461 AW821490 AW821525 AW821526 AW821519 

BE 146808 BE146949 AWB33828 BE147002 BE 1 47066 AW833422AWS33475 AW833485 AW833414 AW833406 AW833395 AW833840 
AW833835 AW833421 BE1 46805 AW833465 BE145753BE147004AW833473AW833573 BE147068AW833466AW833680 BE 147063 BE 147058 
BE146807 AW833830 AW833694 AW833596 AW833601 
AW833441 AW833552AW833700AW833610 AW833673AW833675 

AW99501 1 AW880630 AW995662 AW8801 96 AWB50455 AW995379 AW880634 AW938253 AW880580 AW93B263 AW880584 AW850454 
AW880603 AW880026 AW938265 AW880592 AW852203 AW880588 AW880680 AW995392 AW880598 AW880603 AW880590 AW8801 75 
AW846534 AW880098 AW880108AW860453 AW938051 AW995191 AW880532 AW880542AW938197AW994929 AW880635 AW835438 
AW938057AW938054 

AW836922 AW87671 9 AW876688 AW83691 9 AW836997 AW836908 AW83691 2 AW836993 
AW837675 AW837356 AW837581 AW837348 AW837681 
BE 164598 AW837578 AW837582 
R00903R20424AW844169 

BE241870AI752584Z19223AA307961 AA181165 AA093272AW368570 

AW84601 2 AW846007 AW845996 AW845975 

AW847937 AW847672 AW847809 AW847923 AW847808 AW847616 

AW848427 AW848890 AW848159 AW8481 18 AW848634 AW848285 AW848086 AW848485 AW848283 AW848162 

AWB50327 AW850350 AW850348 AW850375 

AW8511B6 AW996967 BE143456 

BE144593 AW851632 AW968149AW968144 W21059 

AW857199 BE066018 BE065982 BE066082 BE065949 BE066019 BE0G6081 BE065954 BE066084 

AW858443 AW858436 AW861923 AW858551 AW861872 AW858474 AW861904 AW858398 

AW859650 BE071806 AW859655 

AW993247AW861464 

AW863568 BE161696 BE161629 BE161824 

AW897793R20054 H23321 

AW947946 AW947986 AW947977 AW865187 AW865246 AW865182 AW865376 AW865392 AW865184 

AW866330 AW866410 AW866332 AW86641 1 AW866327 AW866326 AW866335 AW866334 AW866331 AW866333 AW866328 AW866329 

T89420N87510AA095932 

AW872477 BE088101 T05990 

AW876548 AW876577 AW876528 AW876623 AW87651 9 AW876540 AW876569 AW876534 AW876563 AW876545 AW876574 

AW895560 AW895557 AW895314 AW895537 AW895287 AW89541 1 

AW898526AW898525Z19700 

AW936233AW936272 

AW936336AW936339 

AW936369 AW936377 AW936355 AW93641 0 AW936460 

AW936686 AW936656 AW936608 AW936635 AW936552 AW936679 AW936726 AW936602 AW936733 

AW937355 AW937450 BE 1 62340 BE 1 62397 

AW939148AW939200BE161819 

AW939537 AW939584 W92705 

AW837985 AW837938 AA101955 AW837913 AW837935 

AW945964 AW946020 AW946034 AW946027 AW946041 AW946044 AW946033 AW946024 AW946021 AW946029 AW94601 5 AW94601 6 

AW946039 AW946045 AW946028 AW946036 

AW947655AW984020 

W27161AW961828 

T31474 T31123 245146 AW962040 

AW977734 H62048 

N31101 N46491 AW994084 

D61870 D61B66BE002674 

BE065941 BE066098 BE066070 BE065997 BE066003 
BE166323 BE067045 

BE300352 BE299274 BE075351 BE297444 
BE144034 N72932BE080176 
AA128535 AW973587 AA553822 

T24065 BE092527 BE092528 BE092204 BE092271 BE092516 BE092202 BE092280 BE092065 BE091991 BE092284 BE092514 
BE091995 BE092063 BE092068 BE091994 BE092283 BE092447 

BE141022 BE141513 BE141532 BE141488 BE141481 BE141526 BE141516 BE141471 BE141478 BE141476 BE141475 BE141021 

BE141533 BE141464 BE 141490 BE141472 8E141480 

N52628BE143639BE143706 

BE145364BE145429BE145418 

BE154078 BE154006 BE153891 BE153523 BE153983 

BE158679BE158714 

BE161004 BE162497 BE161135 BE160999 BE162492 
BE546104 BE165710 BE382801 

BE166924 BE166921 BE166925 BE166915 BE166970 BE166968 

T64682 BE168190 BE168256 

T83742AW6728B2AA135378 

BE262170 BE382553 BE261026 BE273627 

BE2S8094BE267860 

BE297862 
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414356 


1438636.1 


BE549143 BE390613 BE277344 


414833 


148)27 1.1 


T071 14 BE543688 


414985 


1511215.1 


C17372 D631 80 O7890B 063262 


415027 


1515978.1 


D3101 0 030991 031168 031166 031465 


415104 


1522649.1 


060076 060259 061037 


415114 


1523035.1 


D60468 060950 061216 N98936 


415178 


1527354.1 


D80503 081739 N95691 


415298 


1533528 J 


F05086F05091 R17158 


415303 


153360.1 


R 1 1 8 1 3 AA 1 64376 T56632 


415352 


1534744.1 


F06565 Z43466 R18417 F06477 F06476 F07098 


415364 


15350C8.1 


F06771 H04895 Z42778 


415371 


1535066.1 


R15239 245189 F06836 


415392 


1535746.1 


Z44067 F07617 R34555 


415569 


1539911.1 


Z43930 F12170 H10881 T65261 


415572 


1539951.1 


F12294 F12213 T66272 T66261 


415600 


1540373.1 


F12664 T74312 N80318Z20039 


415626 


1540758.1 


Z43847F13068T75331 


415635 


1540853.1 


F1316B R212B9 T77626 


415750 


155215.1 


AA167712 AW935024 AW364438 AW364446 


415773 


1554447.1 


R21651 H00542 


415822 


155791.1 


059243 D63202AA169716 


415948 


156264,1 


AA262226 AI984175 AA171807 


416173 


1574973.1 


R52782 R17313 H24192 R19876 


415343 


1589633 J 


H49213 W90725 H49398 W90726 


416395 


1592777J 


R94575T99886 H52989 


416624 


1604694.1 


H69044 T47567 H75691 T50292 


417304 


166558.1 


H15635H16691M195506 


417396 


1674485.1 


T98987T96744 N78306 


417742 


1696282.1 


R64719 Z44680 R12451 


417974 


171237.1 


AA210765T95700 H94407 


418297 


1736343.1 


R91254T97156 R58711 


418454 


175699.1 


AA315308 AA223392 BE538098 BE087173 


418498 


1762961.1 


T78248 T88763R93361 


418546 


176677.1 


AA224B27 T59708 T59843 BE156903 


418573 


176907.1 


AA225188AL157508AL157509AA225189 


418636 


177402.1 


AW749B55 AA225995 AW7502O8 AW750206 


418712 


1784125.1 


Z42183 T31621T97478 


418946 


180808.1 


AJ21 7097 AW886090 W38035 W38792 AA232835 AW936043 


419059 


1816038.1 


T86216 T86117 T83316 


419145 


182217.1 


N99638 AW973750 AA328271 H90994 AA558020 AA234435 N59599 R94815 


419472 


185148.1 


AW978038 AA804204 AA243400 


419477 


185172.1 


AA826279 AA243426 AW97 1614 AW967805 AW971 605 


419716 


187390.1 


AA553770 AA249091 


419805 


188248.1 


AW966945 AA250984 H91733 AW966941 


419807 


188252.1 


R77402 AA262462 AA250988 R05794 


419834 


188386.1 


AA251139 AW967485 AA251204 


419932 


189159.1 


AA281594 A1365043 AA251996 


419998 


189768.1 


AA252691 AW298356AA688063 


420111 


190755.1 


AA255652 AA28091 1 AW967920 AA262684 


420270 


192170.1 


AA257990 AW816460AI416981 AW500873 


420643 


195293.1 


W87731 AA27S004 AA279001 


420928 


197723.1 


AA281809 H89487 N46537 


421280 


200885.1 


AAB1 1804 AA809404 AA286907 AW977624 


421338 


201378.1 


AA287443 AA419385 BE084078 AI478347 


421381 


201903.1 


AA361752 AW963276 AA288017 


421418 


202288.1 


AA806639 AA2910C8 AA836274 AW97B806 


421708 


205732J 


AW754341 AW858420 AW858475 AW861969 AW861925 AW858554 AW858616 AW858450 AW858545 


421813 


207654.1 


BE048255 AA313083 AA298419 


422204 


213258.1 


AA339015 AW95848 AA306188 


422342 


215498.1 


AA309272 AA30931 2 AW961837 


422429 


' 216469.1 


AA310527 AW962295 Z44865 K06641 


422504 


217160.1 


AA311407 AW958321 N23583 R70050 


422588 


21819^.1 


AA31 2730 AW963285 W28250 W27318 


422760 




oc/nacci acico7CP hwrtnon 


422834 


221906.1 


AA318334 AW961457 AA317752 


423100 


224988.1 


AA323114 AA321992 BE161391 BE161392 


423121 


225175.1 


AW864848 AA322213 AA322209 AW961624 


423368 


227560.1 


AA364195AA325029AW962050 


423632 


230430 1 


AA328824 AW961410 AW958096 


423644 


230585.1 


AA329048AW962484T83628 


423818 


232362^1 


AA332439 AA332579 AA331470 AW962539 


423837 


232478.1 


AW937C63 AW937056 AW937062 AW937059 AW937057 AA331 599 AW937061 AW937093 AW937094 AW937058 AW851 327 BE082B72 




AW752885 AW937055 AW937054 


423912 


233241.1 


BE091233 BE080904 BE079414 BE075356 AA332482 


424353 


238398.1 


AA339646 AA339189 AW966286 


424589 


241202.1 


AW854298 AA343691 H50917 H50907 H50938 


424951 


245273.1 


AW964082 AA348838 AA348839 
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424993 
425360 
425382 
425672 
426356 
426475 
426561 
426603 
427259 
427566 
427980 
428042 
428192 
428337 
428436 



429236 
429917 
429979 
430076 
430350 
430561 
430640 
430795 
431319 
431322 
431670 
431828 
432093 
432222 
432724 
433005 
433108 
433129 
433171 
433311 
433371 
433440 
433586 
433782 
434006 
434092 
434138 
434280 
435382 
436023 
436359 
435383 
436740 
436884 
437087 
437146 

437214 
437240 
437251 
437274 
437324 
437471 
437798 
437B54 
437938 



24578L1 
250631.1 
250555J 
254716.1 
265381.1 
2677B5J 
269158.1 
269825.1 
276594.1 
2B0401.1 
285225.1 
286292.1 
288021.1 
289967.1 
291472.1 



301580.1 

31049.1 

31 1517 J 

312952.1 

316401 1 

319932.1 

321094.1 

323756.1 

331474J 

331543.1 

336353.1 

338201.1 

341283.1 

343347.1 

35330.1 

357346.1 

35896 1 

359479J 

360292.1 

362908 1 

364430.1 

36629 -1 

370470.1 

37414J 

37855.1 

379978.1 

380572.1 

382816J 

405265.1 

41356ZJ 

41847.1 

41888J 

426095.1 

428731.1 

432797.1 

43371.1 

434730.1 

435139.1 

435242.1 

435465.1 

43609.1 

43747.1 

443128.1 

44418.1 

44573.2 



438316 
438366 
438390 
438739 
438990 
439183 
439330 



454745.1 
456119.1 
45662.1 
464193L1 
46760.1 
46956.1 
47119.1 



F07625 R35115 AW953115 F06102H13351 AA349497 F11152 AA707254 

BE547704AA355909 

AA3S5936 AI741379 

AA361483AI557239 

BE536836 M376153 

AL134728 AA379975 AA379440 AW963931 
AA381437 M62B833 AW407275 
AA382291 AA994657 
AA400096AA400167 
A1743515 AA405617 AW276706 
AA418305AI264351 
AA419529H97089 H96977 
AA424051 AW104616 
AA6445G8 AA479489 AA426174 

BE080180 AW827313 AW231970 AA995028 AA428584 AW872716 AW892508 AW854593 AA578441 AW975234 AA664937 AA984131 
AA528743 AA552874 AA564758 AW063245 AI267534 AW070190 AW893483 AA770330 AA906928 AA905582 AA75B746 AA551717 AW06331 1 
AA429538 

AA448407 AA448404 AA448424 AA448426 
H80572AU09691 H80520 
AA463338 M86137 MB5945 AI040226 
AA4651 15 AW967750 AW869525 
BE169639AA476976 

BE065227 BE065366 BE065182 BE065184 AA481239 
M482636 R16336 R 16343 R60706 R53647 
AW971398 AA507162 AA486494 
AA873350 T82429T82428 

AW970622 AA503009 AA502998 AA502989 AA502805 T921B8 

AW971287 AA524976 AA513479 

AA572994 AA516249 AA702595 

H28383 AW972670 H28359 AA52580B 

AI204995 AW827539 AW969906 AW440776 AA528756 

X98266N41124 

AW939074 AW939073 BE1 60476 AW939938 AW939206 AW94001 2 AW939076 AA573577 AW750479 AA574383 AW970057 
AB002446 T03146 

AA577814 N68543 N68478 H79613 AI910764 
AA579425 AW969965 AA579102 
AA688149 AW975336 R82513 R79237 
T25451 AA585296 AA585305 
AF052127 

T85301 AW517087 AA601054 BE073959 
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Reft Sequence source. The 7 digit numbers In this column are Genbank Identifier (Gl) numbers. Dunham L et aL" refers to Ihe pubflcaton enfilted The 

DMA sequence of human chromosome 22.' Dunham I. et aL, Nature (1999) 402:489495. 
Strand: Indicates DMA strand from which axons were predicted. 
Nt oosfflon: Indicates nucleotide positions of orpdlrieri exons. 



Pkey 


Ref 


Strand 


Nlposilion 


400471 


9931670 


Minus 


105629-105760 


400495 


9714773 


Minus 


61902-62137 


400528 


6981824 


Plus 


472381 472528,4741 70-474277,475328-475542,475878-476000 


400577 


7960401 


Plus 


101535-101881 


400608 


9887666 


Minus 


96756-97558 


400541 


8117693 


Plus 


47864992 


400644 


8117693 


Plus 


27682-27840 


400666 


8118496 


Plus 


1 7982-1 B1 15,20297-20456 


400749 


7331445 


Minus 


9162-9293 


400750 


8119067 


Plus 


198991-199168,199316-199548 


400751 


7331445 


Minus 


35395-35533 


400761 


8131609 


Minus 


114220-115164 


400762 


8131616 


Plus 


7235-7605 


400773 


8131629 


Minus 


44116-44238,48208-48321 


400343 


9188605 


Plus 


5863-5970.7653-7784,8892-9023,9673-9807,10634-1 0789, 1 5254-1 5403,23827-23958 


400844 


9188605 


Plus 


24746-24872,25035-25204 


400845 


918B605 


Plus 


34428-34612 


400846 


9188605 


Plus 


39310-39474 


400860 


9757499 


Minus 


151830-152104,152649-152744 


400880 


9931121 


Plus 


29235-29336,36363-36580 


400887 


9958187 


Plus 


119239-121542 


400888 


9956187 


Minus 


199600-199875 


400937 


7652890 


Minus 


89519-89905 


400977 


8072510 


Plus 


73950-74364 


401002 


8117251 


Minus 


77898-78050 


401024 


8117489 


Plus 


60551-60802 


401045 


8117619 


Plus 


90044-90184,91111-91345 


401048 


7232177 


Plus 


132430-132761 


401049 


7232177 


Plus 


149157-150692 


401086 


9957912 


Plus 


84561-84884 


401093 


8516137 


Minus 


22335-23166 


401101 


8568122 


Plus 


77081-77226 


401192 


9719502 


Minus 


69559-70101 


401197 


9719705 


Plus 


176341-176452 


401203 


9743387 


Minus 


172961-173056,173868-173928 


401205 


9743388 


Plus 


167373-167433,167936-168031 


401256 


9796573 


Minus 


45482-45620 


401262 


9796963 


Plus 


35662-35799 


401271 


9797373 


Minus 


61292-61911 


401276 


6954274 


Minus 


15919-16096 


401279 


9800062 


Minus 


13535-13669 


401342 


9908882 


Pius 


3096-3242 


401365 


9796180 


Minus 


119572-119672 


401395 


7327842 


Minus 


1 1 502-1 1771.46357-46489.58720-58916 


401420 


7452889 


Minus 


141362-141502 


401439 


8246737 


Plus 


92993-94026 


401451 


6534068 


Minus 


119926-121272 


401508 


7534110 


Minus 


110779-110983 


401519 


6649315 


Plus 


157315*157950 


401537 


7960358 


Minus 


186786-187029,1 90607-190779,198218-1 98348 


401561 


8224660 


Minus 


10652-10838,19815-20018 


401566 


8469090 


Minus 


96277-96420,96979-97160 


401604 


7689963 


Minus 


119835-120185 


401619 


8516761 


Mnus 


141309-143576 


401669 


9801805 


Plus 


25414-26310 


401691 


3582311 


Plus 


162333-162715 


401694 


3540172 


Minus 


64056-64168 


401747 


9789672 


Minus 


118596-118816,119119-119244,119609-119761.12^ 






13193Z132451-132575.133580-13401 1 


401759 


9929699 


Plus 


59811-60665 


401780 


7249190 


Minus 


28397-28617,28920-29045.29135-29295.29411-29567,29705-29787.30224-30573 


401666 


801B106 


Plus 


73126-73623 


401905 


6671966 


Plus 


153965-154441.156599-156819 


401994 


4153858 


Minus 


4290443124.43211-43336,44607-44763.45199-45281.46337-46732 
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ESTs 0.003 
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Hs.34371 


ESTs 
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0.534 


hohooh 


H90477 


Hs.41407 


ESTs 


o!oi3 


452934 
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um^uiuii vol uvAjrw iiuiioi voioicww w i 


0.202 




AI 0911 'id 

r\L\Jc 1 1 OH 


Hs.76884 


Inhthitnr nf DMA hinrtinn 1 rinminant rkftfl 


2.318 


«H«JOUi 


Ml / WWW IU 


Hs.7734B 


h^ntrnxvnrnstanlandin dRhvdroonnasp 154N 

ii/Ul W*/ piwoiG^ioiiuwi uoiijui i/ywiiaow i*r^n 


1.622 


At A Qf)7 
HlHwW 


X90725 


He 77597 


nnln /DmennhiaUdcn kinacp 
puju \i_iiuoupiua/-iiAa i\Mtooc 


246.564 


451575 


AA767622 


Hs 78893 


KIAA0244 orolein 


1.000 


400303 


AA242758 


Hs.79136 


UV-1 protein, estrofjen reQtilated 


4.552 


416498 


U33632 


Hs79351 


potassium channel, subfamily K, member 1 


5.128 


458921 


A!682088 


Hs.79375 


hotrcarboxylase synthetase (bio&Hprop 


0.245 


409235 


AA188827 


Hs.7988 


ESTs. Weakly similar to I38022 hypolheli 


7.249 


440371 


BE258550 


Hs.80449 


Homo sapiens, clone 1MAG&3535294, mRNA, 


0.792 


452732 


BE300078 


Hs.80449 


Homo sapiens, done IMAGE:3535294 t mRNA, 


360.782 


417003 


AL038170 


Hs.80756 


betalne-homocysteine methyttransferase 


60.590 


407584 


W25945 


Hs.8173 


hypothetical protein FU 10803 


9.988 


417312 


AW888411 


Hs.81915 


leukemia-associated phosphoprotein p18 ( 


402.705 


417389 


BE260964 


Hs.82045 


midkine (neurite grcwUv promoting factor 


10.806 


428839 


AI767756 


Hs.82302 


Homo sapiens cDNA RJ14814 fis, clone NT 


1.051 


438315 


R56795 


Hs.82419 


ESTs 


0.226 


417900 


BE250127 


Hs.82905 


CDC20 (cell drviston cyde 20, S. cerevi 


26.260 


417933 


X02308 


Hs.82962 


thyrnldyiate synthetase 


221.090 


418067 


AI127958 


Hs.83393 


cystatlnETM 


Z396 
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5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 



438086 


AA336519 


Hs.B3623 


41B205 


121715 


Hs.83760 


413385 


M34455 


Hs.840 


418322 


AA284166 


Hs.84113 


413529 


U11874 


Hs.846 


458027 


L49054 


Hs.85195 


418543 


NM.005329 


Hs.85982 


418583 


AA604379 


Hs.86211 


441801 


AW242799 


Hs.86366 


414792 


BE314949 


Hs.87128 


407246 


S70348 


Hs.87149 


433417 


AA587773 


Hs.8859 


445060 


AA830811 


Hs.88808 


453450 


AW797627 


Hs.89474 


419227 


BE537383 


Hs.89739 


401464 


AF039241 


Hs.9028 


443162 


T49951 


Hs,9029 


431024 


AA713666 


Hs.90462 


419559 


Y07828 


Hs.91096 


419741 


NM.007019 


Hs.93002 


443426 


AF098158 


Hs.9329 


424457 


A1249036 


Hs.94292 


410348 


AW182663 


Hs.95469 


439738 


BE246502 


Hs.9598 


421478 


A16B3243 


Hs.97258 


443767 


BE562136 


Hs.9736 


426902 


AI125334 


Hs.97408 


444874 


AI218496 


Hs.97515 


427356 


AW023482 


Hs.97B49 


430000 


AW205931 


Hs.99598 


419485 


AA469023 


Hs.99807 


420783 


AI659838 


Hs.99923 


421934 


AA300625 




431322 


AW970622 




424040 


AA334400 




433108 


AB002446 




458829 


AI557388 




459169 


AI905517 




400300 


X03363 




440012 


AA861072 




412799 


AI267606 




412964 


BE019688 




406992 


S82472 




414969 


C16195 




413158 


BEO68098 




453823 


AL137967 




453846 


AL157586 




407055 


XB9211 




415204 


T27434 




434572 


AF147340 




438990 


AF085890 




439780 


AL109688 




413671 


Z43712 




406974 


M57293 




455797 


BE091833 




455807 


BE141140 




432189 


AA527941 




443309 


AI821874 




437240 


AA747537 




455189 


AW864176 




444163 


AJ126098 




455170 


AW860972 




454789 


BE156314 




433005 


AW939074 




455380 


BE160188 




455650 


BE064655 




436383 


BE065178 




413100 


8E065208 




428436 


BE080180 




455831 


BE144966 




434414 


AI798376 




414221 


AW450979 




409488 


AW402825 




437938 


AI950087 




451385 


AA017655 




449325 


AA001162 




413316 


W91931 





nuclear receptor subfamily 1 , group I, m 1.000 

IroponJn 1. skeletal, rast 0.159 

IraWeamine-pyrrole 2,3 dtaxygenase 0.490 

cyclin-dependent Wnase inhibitor 3 (CDK 1527 

inteneukin 8 receptor, beta 0.077 

myeloid leukemia factor 1 0.008 

hyaturonan synthase 3 1.813 

hypothetical protein 125,769 

ESTs 55.026 

hypothetical protein RJ23309 ai39 

lntegrin, beta 3 (platelet glycoprotein 0.020 

Homo sapiens, Similar to RIKEN cDNA 5830 313.141 

ESTs 1.000 

ADP-ribosyiatkm factor 6 137.718 

cholinergic receptor, nicotinic, beta po 0.006 

histone deacetylase 5 6.646 

DKFZP434G032 protein 14.057 

Homo sapiens, clone IMAGE:4 132043, mRNA, 3.507 

ring finger protein 0.025 

ubiquitln carrier protein E2-C 684.577 

chromosome 20 open reading frame 1 0.363 

hypothetical protein FU2331 1 175.667 

ESTs 0.011 

sema domain, Immunoglobulin domain (Ig), 428.231 

ESTs, Moderately similar to S29539 ribos 0.005 

proteasome (prosome, macropain) 26S subu 1.168 

ESTs 37.467 

BRCAl-lnteracting protein 1 ; BRCA1-assoc 0.067 

ESTs 1.000 

hypothetical protein MGC533B 0.81 2 

ESTs, Weakly similar to unnamed protein 0.342 

lectin, galactoside-binding, soluble, 7 19.785 

gb:EST1 3476 Testis tumor Homo sapiens cD 103.769 

gb:EST382704 MAGE resequences, MAGK Homo 0.258 

gb:EST3851 0 Embryo, 9 week Homo sapiens 0.01 1 

gb:Homo sapiens mRNA from chromosome 5q2 0.023 

gb:PT2.1_6_G03.r turnotf Homo sapiens cON 1 .000 

gb:RC-BT091 -2 1 01 99- 1 05 BT091 Homo sapien 0.773 

HER2 receptor tyrosine kinase (c-erb-b2, 468.462 

gb:ak32e05.s1 SoaresJestis.NHT Homo sap 0.002 

gbaq91h03jc1 Stanley Frontal SB pool 1 0.010 ■ 

gb:bb28g08jc1 N1H_MGC_5 Homo sapiens cDN 0.003 

gb:beta -pol=DNA polymerase beta {exon a 0.005 

gb:C16195 Oontech human aorta pdyAmRN 0.023 

gb:CM1-BT0368-061299460-c09 BT0368 Homo 0.007 

gb:DKFZp761D2315 rl 761 (synonym: hamy2) 0.994 

gb:DKF2£761H021 6_r1 761 (synonym: hamy2) 0.004 

gb:H.sapiens ONA for endogenous retrovir 0.037 

gbJibc2294 Human pancreatic islet Homo s 76.500 

gb:Homo sapiens full length insert cDNA 0.030 

gb:Homo sapiens full length insert cDNA 1.000 

gb:Homo sapiens mRNA full length insert 0.256 

gb:HSC1 JA121 normalized infant brain cON 0.009 

gb:Human parathyroid hormone-related pep 0.004 

gb:IL2-BT0731-260400-076-F04 BT0731 Homo 2.616 

gb:MRO-HT007S02l299-006^07 HTO075 Homo 0.413 

gb:nh30c04.s1 NCLCGAP_Pr3 Homo sapiens 0.015 

gb:nt58f 1 0jc5 NCLCGAP^PrS Homo sapiens 0.007 

gb:nx85c05.s1 NCLCGAP_GCB1 Homo sapiens 0.006 

gb:PMO-SN0014-260400.002-b08 SN0014 Homo 0.069 

gb:qc54g07j(1 Soares_placenta_8to9weeks_ 394.282 

gb:QVO-CT0387-180300-167-h07 CT03B7 Homo 0.757 

gb:QVr>HT0367-150200-1 14-d02 HT0367 Homo 1.000 

gb:QV1-DT006941O200-O57-c12 DT0069 Homo 0.013 

gb:QV1-KT04i;W)1020lH>59-g05 HT0413 Homo 0.249 

gb:RC1 -BT03 1 3-301 299-01 2-c09 BT0313 Homo 1.000 

gb:RC1-BTO314-O2020(H>12-h01 BT0314 Homo 1.000 

gb:RC1.BT0314-31030(M)1W>09 BT0314 Homo -271.372 

gb:RC4-BT0629-1 20200-01 1-b 10 BT0629Homo 0.002 

gb:RC6-HT0187-201099431.c04 HT0187 Homo 0.011 

gb:tr34b07.x1 NCLCGAP_Ov23 Homo sapiens 293.654 

gb:UI-H-Bl3-ala-a-12-0-Ul.s1 NCLCGAP Su 0.807 

gb:UI-HF-BK0-aaq-(WWVUlJl N1H MGC 36 0.965 

gbwq05c02j{1 NCI_CGAPJ<id12 Homo sapien 1952 

gb2e39h01 j\ Scares retina N2MHR Homo 7.341 

gbze48bQ6 jl Scares retina N2MHR Homo 0.004 

gb:zh47cOU1 Scares Jetatjiver_spleen_. 0.004 
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401016 


0.342 


401335 


0.256 


401555 


1.000 


401760 


301.372 


401761 


247J41 


401961 


1.722 


402239 


5.180 


402305 


o!917 


402424 


551.141 


402777 


151231 


402778 


0.008 


402837 


0.367 


402948 


154.103 


402952 


17.038 


403142 


0.196 


403297 


12.744 


403637 


0.304 


403657 


0.032 


404136 


0.008 


404249 


0.065 


404875 


1.105 


404917 


69.590 


404983 


1.000 


405238 


1.000 


405364 


294.141 


405531 


1.747 


405601 


145.551 


405621 


0.224 


405932 


1.968 


406117 


0.333 


406354 


1.000 


406548 


0.002 


406599 


0.010 


459702 AI204995 


0.449 



TABLE 2B 

Pkey: Unique Eos probeset identifier number 
CAT number Gene duster number 
Accession: Genbank accession numbers 



Pkey 

409488 

412799 

412964 

413100 

413158 

413316 

413671 

414221 

414969 
415204 
421934 
424040 
428436 

431322 
432189 
433005 
433108 
434414 



434572 
436383 
437240 
437938 



CAT Number 

1134791J 

132817 1 

1339278J 

1349119.1 

1351251J 

1360169.1 

1382504.1 

142696.1 

1510393.1 
1529407 J 
209339J 
234659.1 
291472.1 

331543 1 
342819.1 
357346.1 
35896.1 
38585J 



38911.1 
41888.1 
435139.1 
44573J 



Accession 

AW402825BE544338 
AI267606 AA121045 AA126521 
BE019688BE144460 

8E065208 BE065224 BE065168 BE065313 

BE068098 BE068119 BE068063 BE06B088 BE066120 BE06B1 55 BE068111 
W91931 W94979BE081744 

Z43712 BE156729 BE156538 BE156731 BE156673 BE156539 BE156674 BE156430 BE156672 BE156675 BE156432 BE156541 

AW450979 AA136653 AA136656 AW419381 AA984358 AA492073 BE168945 AA809054 AW238038 BE01 1212 BE01 1359 BE01 1367 

BE01 1368 BE01 1362 BE01 1215 8E01 1365 BE01 1363 

C16195C16230C16211 C16164C16251 

T27434 Z25288 F00323 D82802 R85077 

AA300625 R16859R16860AW898335W24337 

AA334400 AA334257 AW9661 24 

BE080180 AW827313 AW231970 AA995028 AA428584 AW87271 6 AW892508 AW854593 AA578441 AW975234 AA664937 AA984131 AA528743 
AA552874 AA564758 AW063245 AJ267534 AW070190 AWB93483 AA770330 AA906928 AA906582 AA758746 AA551717 AW06331 1 AA429538 
AW970622 AA5030 09 AA502998 AA502989 AA502805 T921 88 
AA527941 AI810608 AI620190AA635266 

AW939074 AW939073 BE160476 AW939938 AW939206 AW94001 2 AW939076 AA573577 AW750479 AA574383 AW970057 
AB002446 T03146 

AI798376 S46400AW811617AW811616W00557BE142245AW858232AW861B51 AW858362 AA232351 AA218567AA055556AW858231 
AW857541 AW814172 H6621 4 AW814398 AF1 34164 AA243093 AA1 73345 AA199942 AA223384 AA227092 AA227080 T12379 AA0921 74 
T61 139 AA149776 AA699829 AW879188 AW813567 AWB13538 AI267168 AA157716 M167719 AA100472 AA100774 AA130756 AA 157705 
AA157730 AA157715 AA053524 AW849581 AW854566 C05254 AW882836 T92637 AW812621 AA206583 AA209204 BE156909 AA226824 
A1829309 AW991957 N66951 AA527374 H66215 AA045564 AI694265 H60808 AA149726 AW195620 BE081333 BE073424 AW817662 
AW817705AW817703AW817659BE081531 H59570 
AF1 47340 T51948 T52029 
BE065178AJ227879 
AA747537 BE089068 BE0B9070 

A1950087 N70208 R97040 N36809 A13C81 19 AW967677 N35320 AI251473 H59397 AW971573 R97278 W01059 AW967671 AA908598 AA251875 
AI820501 A1820532 W87891 T85904 U71456 T82391 BE328571 T75102R34725AA884922BE328517 AI21 9788 AA884444N92578F 13493 
AA927794 A1560251 AW8740S8 AI134043 AW235363 AA663345 AW008282 AA488964 AA283144 AI890387 AI950344 AI741346 AI689062 
AA282915 AW102898 A18721 93 AJ763273 AW173586 AW150329 AI653832 A1762688 AA988777 AA488892 A1355394 AW103813 AI539642 
AA642789AAB56975AW505512AI961530AW629970BE612881 AW276997 AW513601 AW512843 AA044209 AW856538 AA180009 AA337499 
AW961101 AA251669 AA251 874 AI81 9225 AW205862AJ683338 AI858509 AW276905 AI633005AA972584 AA908741 AW072629AW513996 
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AA293273 AA969759 N75628 N22388 H84729 HB0052 T92487 A1022058 AA780419 AA551005 W80701 AW613456 AI373032 A! 564269 F00531 






HB3488 W37181 W78802 R66056 AI002839 R67840 AA300207 AW959581 T63226 F04005 


438990 


46760J 


AF085890 H29949 H29856 


439780 


47673~1 


AL1MGR3 R23665 R26578 


440012 


483290 1 




443309 


56604 1 


AI821 874 A1821 86fl AA630932 AAB53897 AA650103 AI821 131 AI821 1 24 


4441 fil 


593559 \ 


Al 1 26MB A) 1 84746 A11 48521 


449325 


8048Q.T 


AA001 162 AA018950 AA017505 R84446 


451385 


86787J 


AA017656 AA017374 AA019761 


453823 


982526.1 


AL137967 BE064160 BE0641B6 


453846 


983043.1 


AL157588AL157590 


454789 


1234742.1 


BE156314BE156316AW820750 


455170 


1256906.1 


AW850972 AW862598 AW862599 AW860988 AW8509B3 AW860898 AW860925 AW860922 AW860986 AWB60984 AW860989 


455189 


1259271.1 


AW864176AW864133AW864185AW864137 


455380 


1287679.1 


BE160188 AW935785 BE160401 BE160319 BE160313 BE160395 


455550 


1348720.1 


BE064655BE153953 


455797 


1366826.1 


BE091833 BE091874 BE091871 


455807 


1370914.1 


BE141140 BE141 139 BE141 105 BE141143 BE141127 BE141202 BE141108 


455831 


1373969.1 


BE144966BE144957BE144958 


458829 


773443.1 . 


AI557388 BE158936 


459169 


920641.1 


AI905517 AI905455 AI905452 



TABLE 2C 

Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Idenllfier (Gl) numbers. "Dunham I. el al." refers to the publication entitled The DNA 

sequence of human chromosome 22." Dunham I. et al., Nature (1999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 

NLposition : indicates nucleotide positions of predicted exons. 



Pkey 


Ref 


Strand 


NLposition 


401016 


8117441 


Ptus 


126234-126359,128050-128236 


401335 


9884881 


Plus 


15736-16352 


401555 


8099284 


Minus 


162520-162657 


401760 


9929699 


Pius 


831 26-83250.85320-85540,947 19-95287 


401781 


7249190 


Minus 


83215-83435,83531-83656,83740-83901,84237-84393,84955-85037,8629(^86814 


401961 


4581193 


Minus 


124054-124209 


402239 


7690131 


Pius 


38175-38304,4213342266 


402305 


7328724 


Plus 


40832-41362 


402424 


9796344 


Minus 


64925-65073 


402777 


9588235 


Plus 


126786-126948 


402778 


9588235 


Plus 


128560-128702 


402837 


9369121 


Minus 


2013-2186,9570-9758,11136-11309,19429-19677,2121 0-21455,23368-23562.24342-24527,29132-29320 


402948 


9368458 


Minus 


143456-143626,143808-143935 


402952 


9408724 


Minus 


119452-119619 


403142 


9444521 


ptus 


89286-90131 


403297 


8096824 


Minus 


16584-17264 


403637 


8671936 


Minus 


142647-142771,145531-145762 


403657 


8843996 


Minus 


156223-156370 


404136 


6981900 


Minus 


42S3&46428 


404249 


8655533 


Phis 


6427IW4633 


404875 


9801324 


Plus 


965B8-96732.97722-97831 


404917 


7341851 


Plus 


4933049498 


404983 


4432779 


Minus 


51178-51374,52000-52173 


405238 


7249119 


Minus 


51728-51836 


405354 


2281075 


Minus 


4832548491,49136-49252 


405531 


9665194 


Plus 


35602-35803 


405601 


5815493 


Minus 


147835-147935,149220.149299 


405621 


5523811 


Pius 


59362-59607 


405932 


7767812 


Minus 


123525-123713 


406117 


9142932 


Ptus 


54304-54584 


406354 


9256049 


Minus 


2095-2377 


406548 


7711514 


Minus 


25138-26762 


406599 


8248616 


Phis 


10933-11086 
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Table 3A; Preferred therapeutic targets for bladder cancer 

Pkey: Unique Eos probeset Identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene genB 6Ua 

R1: 90th percentile of bladder tumor A)s divided by the 90th percentile of normal bladder sample Ate 

R2: 90th percentile of bladder tumor Als divided by the SOU) percentile of normal body sample Als 



Pkey 


ExAccn 


UnigenelD 


Unigene Title 


R1 


R2 


421948 


L42583 


Hs.334309 


keratin 6A 


14.20 


1.20 


439926 


AW014875 


Hs.137007 


ESTs 


11.31 


21.34 


413324 


V00571 


Hs.75294 


corticotropin releasing hormone 


9.15 


45.75 


421110 


AJ250717 


Hs.1355 


cathepsin E 


9.07 


45.35 


417308 


H60720 


Hs.B1892 


KIAA0101 gene product 


aso 


1.99 


431211 


M86849 


Hs.323733 


gap junction protein, beta 2, 26kO (conn 


a39 


1.26 


418406 


X73501 


Hs.84905 


cytokeralin 20 


aio 


40.50 


445619 


AU076643 


Hs.313 


secreted phosphoproteln 1 (osteoponlin, 


7.98 


1.38 


433001 


AF217513 


Hs.279905 


clone HQO310PRO0310p1 


7.67 


2.12 


408243 


Y00787 


Hs.624 


intsrieukin 8 


7.56 


4.85 


417715 


AW969587 


Hs.86366 


ESTs 


7.45 


4.70 


417720 


AA205625 


Hs.208067 


ESTs 


7.34 


9.18 


423673 


BE003054 


Hs.1695 


matrix metailoproteinase 12 (macrophage 


7.30 


26.07 


418007 


M13509 


Hs.83169 


matrix metalloproteinase 1 (interstitial 


7.12 


35.60 


413753 


U17760 


Hs.75517 


{arnfnin, beta 3 (orcein (125kO), kalinin 


6,95 


4.96 


441633 


AW958544 


Hs.112242 


normal mucosa of esophagus specific 1 


6.42 


0.89 


422168 


AA586894 


Hs.1 12408 


S100 calcium-binding protein A7 (psorlas 


6.08 


3.49 


407242 


M18728 




gkHuman nonspecific crossreacting antig 


5.96 


0.96 


405033 






C1002652^i|544327lsp|O04799|FMO5 - RABlT 


5.84 


16.22 


449230 


BE613348 


Hs.211579 


melanoma cell adhesion molecule 


5.82 


2.28 


406685 


M18728 




gb:Human nonspecific crossreacting antig 


5.80 


0.89 


420159 


AI572490 


Hs.99785 


Homo sapiens cDNA: FU21245 (is, clone C 


5.77 


28.85 


415511 


AI732617 


Hs.182362 


ESTs 


5.65 


28.25 


426028 


NMJM1110 


Hs.172028 


a cfislntegrin and metailoproteinase doma 


5.60 


6.51 


424008 


R02740 


Hs.137555 


putative chemokine receptor; GTP-binding 


5.59 


2.33 


428651 


AF196478 


Hs.188401 


annaxin A10 


5.55 


27.75 


400843 






NMJH)3105*:Homo sapiens sortJIiivrelated 


5.51 


4.92 


402230 






Target Exon 


5.36 


21.44 


452747 


BE153855 


Hs.61460 


Ig superfamily receptor LNIR 


5.33 


2.80 


416065 


BE267931 


Hs.78996 


proliferating cell nuclear antigen 


5.17 


1.98 


428450 
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1.83 
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1.74 
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1.61 
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AW97n55 
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1.81 


3.67 


431846 


BE019924 


Hs.271580 
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1.80 


4.11 


410153 
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Hs. 15830 
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NM_002497 


Hs. 153704 
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differentially expressed to FanconPs an 


1.76 


1.43 


439606 
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LI- CBCB4 

Hs.58551 


G protein-coupled receptor 87 


1 7fi 
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8.80 


424098 


AF0 77374 


HS. 139322 


small proiine-rich protein 3 


1.76 


0.57 


4J0Q9U 
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HS.2099 


grypican 1 


1.73 


1.39 


452862 


AW378065 


Hs.8687 


ESTs 
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1.1 0 
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1.65 


428484 
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Ue iQAGM 

MS.1O40U1 
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1.72 


1.03 


453883 


A1638516 


Hs.347524 


cofactor required for Sp1 trans cripliona 


1 71 

i«/ 1 


1.66 


448993 


A1471630 


Hs.8127 


KIAAU144 gene product 


1 71 
I./ I 


1.52 


422406 


Ar025441 


Ue none 
HS.1 lbZUb 


Opa-in(eracting protein 5 


1.71 


5.52 


428664 
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Ue 4QQOOC 

Hs. 189095 
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similar to oalli (sa ^urosopniiaH'*« 


1.71 


8.55 


444342 
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HS.IUoO/ 
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1.71 
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1.71 
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1.71 


13.98 


447532 


AKUUUoK 


Ue 4Q7Q4 

HS.Io/91 


nypotneucai protein ruzuou/- 


1.70 


1.84 


414053 




Hs.75725 


transgelin 2 


1.69 


1.51 


447342 
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Hs.19322 


Homo sapiens, oimtiar to ki^n cuna zuiu 


1.09 


8.45 


426050 
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Ue iccnoc 
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1.60 
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1 fifi 
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HS.0l<U2 


CCTe 
CO IS 


1.66 


1.23 


417151 


a a 4nj/trr 

AA1 94055 


Hs. 293858 


ESTs 
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I .DO 


2.08 


452461 


N78223 


Hs.108106 


transcription factor 


1 

I .DO 


8.25 


418462 


BE00159Q 


Hs. 85266 


intsgnn, ceta 4 


1 .DO 


1.78 


417900 


BE250127 


Hs.82906 
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1.0*4 


1 

1.09 


438746 


AI885815 


HS.1B4727 


Human melanoma-associated antigen p97 (m 


1 RA 


1.13 


423161 


ai Ajnim 

AL049227 


Hs.1 24776 


downstream ot caonenn o my 0.0x0; 


1 KO 
I .DO 


1.81 


400900 




Ue C971 1 


Uinh mnMUhf nmnn /nnrrhkfonB chromoso 


1.62 


1.51 


402777 






C10O2652 t :gi|544327]sp|Q04799|FMO5_RABIT 


1.62 


Z33 


436569 


BE439539 


Hs.279837 


gtutathione S-transferase M2 (musde) 


1.62 


Z18 


417515 


L24203 


Hs.82237 


ataxia-telangiectasia group D-assodated 


1.62 


1.27 


413385 


M34455 


Hs.840 


Indoteamine-pyrrote 2.3 dloxygenass 


1.61 


Z05 


410407 


X66839 


Hs.63287 


carbonic anhydrase IX 


1.60 


1.78 


450635 


AW403954 


Hs.25237 


mesenchymai stem cell protein DSCD75 


1.60 


1.63 


437016 


AU076916 


Hs.5398 


guanine monphosphate synthetase 


1.59 


1.50 


451982 


F13036 


Hs.27373 


Homo sapiens mRNA; cONA DKFZp56401763 (f 


1.58 


1.92 


422247 


U18244 


Hs.1 13602 


solute carrier family 1 (Ngn affinity a 


1.57 


1.79 


408908 


BE296227 


Hs.250822 


serine/threonine kinase 15 


1.56 


7.80 


433159 


AB0358S8 


Hs.150587 


kinesin-Gke protein 2 


1.56 


7.80 


443211 
409893 


AJ128383 


Hs.143655 


ESTs 


1.56 


7.80 


AW247090 


Hs.57101 


rrumdiromosorne maintenance deficient (S. 


1.55 


1.44 
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426900 AW163564 Hs.142375 ESTs 

421066 AU076725 Hs.101408 branched chain aminotransferase 2, mitoc 

413804 T64682 gb^c48b02.r1 Stratagane Uver (937224) 

418641 BE243136 Hs.86947 adslntegrinand metalloproteinase dorna 

5 444783 AK001468 Hs.62180 anilQn (Drosophila Scraps homolog), act 

414035 Y00630 Hs.75716 serine (or cysteine) proteinase inhibito 

418543 NM.005329 Hs.85962 hyaturonan synthase 3 

429211 AF052693 Hs.198249 gapiundionprotein;bela5{connexln3 

402260 NM.001436*:Homo sapiens fibrillarin (FBL 

10 424264 D80400 Ks.239388 Human DNA sequence from clone RP1-304814 

433091 Y12642 Hs.3185 lymphocyte antigen 6 complex, locus 0 

422164 NM.014312 Hs.112377 cortic a) Ihymocyte receptor (X. taevis 

444163 A1126098 gb:qc54g07.x1 Soares_placenta_8to9weeks_ 

452304 AA025386 Hs.61311 ESTs, Weakly similar to S 10590 cysteine 

15 445182 AW189787 ESTs 

408930 AA146721 Hs.334686 hypothetical protein FU21 588 

429359 W00482 Hs.2399 matrix metailoproteinase 14 (membrane-in 

406467 Target Exon 

424244 AV647184 Hs.143601 hypothetical protein hCLA-iso 

20 422094 AF129535 Hs.272027 F-box only protein 5 

431322 AW970622 gb:EST382704 MAGE resequences, MAGK Homo 1.39 

401760 Target Exon 

425247 NM.005940 Hs.155324 matrix metalloprotetnase 1 1 (stromelysin 

424420 BE614743 Hs.146688 prostaglandin E synthase 

25 421817 AF146074 Hs.108660 ATP-Wnding cassette, sub-family C (CFTR 

422119 A1277829 Hs.111B62 KIAA0590 gene product 

418729 AB026449 Hs.67889 helicase-mol 

418399 AF131781 Hs.84753 hypothetical protein FU 12442 

453028 AB006532 Hs.31442 RecQ protein-like 4 

30 431369 BE184455 Hs.251754 secretory leukocyte protease inhibitor ( 

453321 AI984381 Hs.232521 ESTs 

421478 A1683243 Hs.97258 ESTs, Moderately simDar to S29539 ribos 

425726 AF085808 Hs.159330 urop!akin3 

406906 Z25424 gb:H.sapiens proteirvserine/threonlne ki 

35 429413 NH.014058 Hs.201877 OESC1 protein 

418678 NM.Q01327 Hs,87225 cancerAestis antigen (NY-ESO-I) 

431840 AA534908 Hs.2860 POU domain, class 5, Iranscription facto 

417433 BE270265 Hs.82128 5T4 oncofetal trophoblast glycoprotein 

422397 AJ223366 Hs.1 16051 Homo sapiens cDNA: FU22495 lis, clone H 

40 403903 C5001632*:gi|10645308|gb|AAG21430.1|ACOO 

425721 AC002115 Hs.159309 uroplakinIA 

4J3943 AW294416 Hs.144687 Homo sapiens cDNA FU 12381 fis, doner/T 

431515 NM 012152 Hs.258583 endothelial differentiation, lysophospha 

443859 NM.013409 Hs.9914 foliblatin 

45 419743 AW408762 Hs.5957 Homo sapiens clone 24416 mRNA sequence 

422330 D30783 Hs.115263 epiregulin 

420281 A1623693 Hs.323494 Predicted cation efflux pump 

424717 H03754 Hs.152213 wingless-type MMW Integration site fami 

440304 BE159984 Hs.125395 ESTs 

50 422170 AI791949 Hs.112432 anti-Mullerian hormone 

417599 AA204688 Hs,62954 ESTs 

411874 AA096106 Hs.20403 ESTs 

449961 AW265634 Hs.133100 ESTs 

418506 AA084248 Hs.85339 G protein-coupled receptor 39 

55 432239 X81334 Hs.2936 matrix metailoproteinase 13 (coll agenase 

453216 AL137566 Hs.32405 Homo sapiens mRNA; cONA OKFZp566G0321 [( 

423634 AW959908 Hs.1690 heparin-binding growth factor binding pr 

423017 AW178761 Hs.227948 serine (or cysteine) proteinase inhibito 

453355 AA035211 Hs.17404 SOX7 SRY (sex determining region YJ-box 

60 439239 AI031540 Hs.235331 ESTs 

450684 AA872605 Hs.25333 interleukin 1 receptor, type II 

425650 NNL001944 Hs.1925 desmoglein 3 (pemphigus vulgaris antigen 

404403 Target Exon 

406974 M57293 gb:Human parathyroid hormone-related pep 

65 410348 AW182663 Hs.95469 ESTs 

412661 N32860 Hs.24611 ESTs, Weakly similar to 154374 gene NF2 

419121 AA374372 Hs£9626 parathyroid hormone-tike hormone 

426320 W47595 Hs.169300 transforming growth factor, beta 2 

426968 U07616 Hs.173034 amphiphysin (SUfl-Mann syndrome with br 

70 432097 X51730 Hs.2905 progesterone receptor 

452401 NMJJ07115 Hs.29352 tumor necrosis factor, alpha-induced pro 

453389 BE273648 Hs.32963 cadherin 6, type 2, K-cadherin (fetal ki 

419078 M93119 HS.895B4 insulirrama^ssodated 1 

430378 Z29572 Hs.2556 tumor necrosis factor receptor superfami 

75 428182 BE386042 Hs.293317 ESTs, Weakly elmilar to GGC1_HUMAN G ANT 

451844 T61430 gb:yc0Sa03.s1 Stratagene lung (937210) H 

415178 080503 Ks.322850 ESTs 
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highly expressed in cancer, rich in tauc 
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ESTs 
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ESTs 
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ESTs 
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ESTs 
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ESTs 
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ESTs 
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ESTs 
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ESTs 
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414521 
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0.84 


0.85 


439569 


AW602166 
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CEGP1 protein 


0.64 


Z42 


432222 


AI204995 




gb:an03c03.x1 Stratagene schlzo brain S1 


0.81 


6.75 


407846 


AA426202 


Hs.40403 


Cbp/p300-intsracu'ng transactivator, wit 


0.60 


0.57 


457292 


AJ921270 


Hs.281462 


hypoUietical protein FU 14251 


0.77 


1.40 


431069 
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ESTs. Weakly simflar to unknown protein 
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gb:an03c03j?1 Stratagene schizo brain S1 
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11.03 


424503 


NM.002205 


Hs.149609 


integrin, alpha 5 (fibronedin receptor, 


0.56 


0.80 


406964 


M21305 




gb:Human alpha satellite and satellite 3 


0.53 


11.16 


416225 


AA577730 


Hs.188684 


ESTs, WeaWy similar to PC4259 ferritin 


0.48 


3.01 


400288 


X06256 


Hs.149609 


Integrin, alpha 5 (fibronectin receptor, 
(tfochrome P450, subfamBy I (aromatic c 
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0.70 
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TABLE 3B 

Pkey: Unique Eos probeset identifier number 
CAT number. Gene cluster number 
Accession: Genbank accession numbers 



Pkey CAT Number Accession 

41 1880 1263110J AW872477 BE088101 T05990 

413804 1390710J T64682 BE168190 BE1 68256 

414221 142696 1 AW450979AA1 36653 AA1 36656 AW41 9381 AA984358 AA492073BE1 68945 AA809054 AW238038 BE01 1212 BE01 1359 BE01 1367 

BE011368 BE01 1 362 BE01 1 21 5 BE01 1 365 BE01 1 363 

415989 1 56454 1 AJ267700AI720344AA1 91424 A10235« AJ469633M172fl56AVV958465M1 72236 AW953397AA355086 

431089 327825 1 BE041395 AA491826 M621946AA715980AA666102 

431322 331543 1 AW970622AA503009 AA502998AA502989AA502805 T92188 

432222 343347J A1204995 AW827539 AW969908 AW440776 AA528756 

436608 42361 3 AA628980AI1 26603 BE504035 

439780 47673 1 A1109688 R23665 R26578 

444163 593658J AM 26098 A) 164746 A) 148521 

445182 632151 1 AW189787 AI215430 AW268499 AW205930A1392907 BE093017 BE093019 BE093010 

451644 888230 t T61430 AJ820546 A182I336 

456034 142696 1 AW450979AA1 36653 AA1 36656 AW41 9381 AA984358AM92073BE168W5M8090^AW238038BE011212BE011359 BE011367 
BE011368 BE01 1 362 BE01 1 21 5 BE01 1365 BE01 1 363 



TABLE 3C . 

Pkey: Unique number corresponding to an Eos probeset 

Ret Sequence source. The 7 digit numbers In this column are Genbank Identifier (Gl) numbers. ■Dunham I. et at.' refers to the publication enfitled The DNA 

sequence of human chromosome 22/ Dunham I. et ai, Nature {1999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 
NLpostlion: indicates nucleotide positions of predicted exons. 



Ptey Ret Strand NLposition 

400773 8131629 Minus 44116-44238,48208-48321 
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404875 
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404877 


1519284 


Plus 


1095-2107 


404977 


3738341 


Minus 


43081-43229 


405033 


7107731 


Minus 


142358-142546 


405932 


7767812 


Minus 


123525-123713 


406081 


9123861 


Minus 


38115-38691 


406399 


9256288 


Minus 


63448-63554 


406434 


9256651 


Minus 


17803-17931 


406467 


9795551 


Plus 


182212-182958 
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TABLE 4A: Preferred diagnostics for bladder cancer 



Pkey: Unique Eos probeset identifier number 
ExAccn: Exemplar Accession number, Genbank accession number 
UnlgenelD: Unigene number 
Unigene Title: Unigene gene title 



R1: 
R2 

R3 



80th percentile of muscle-invasive bladder tumor {stage T2-T4) Als divided by the 80th percentile of exophytic non-Invasive carcinoma (stage Ta) Ate 
90th percentile of bladder tumor Als minus background divided by 90th percentile of normal body sample Als minus background, where background equals 
the 15th percentile of aD sample Als 



Pkey 


ExAccn 


UnlgenelD 


423961 


D13666 


Hs.136348 


446619 


AU076643 


Hs.313 


444381 


BE387335 


Hs.283713 


408243 


Y00787 


Hs.624 


413063 


AL035737 


Hs.75184 


414020 


NM_002984 


Hs.75703 


424247 


X14008 


Hs.234734 


418007 


M13509 


Hs.83169 


422158 


110343 


Hs.1 12341 


446921 


AB012113 


Hs.16530 


413324 


V00571 


Hs.75294 


412429 


AV650262 


Hs.75765 


406636 


LI 2064 




428330 


L22524 


Hs.2256 


406687 


M31126 




422550 


BE297626 


Hs.296049 


410867 


X63556 


Hs.750 


416658 


U03272 


Hs. 79432 


414812 


X72755 


Hs.77367 


423673 


BE003054 


Hs.1 695 


421379 


Y15221 


Hs.103982 


429500 


X78565 


Hs.289114 


417849 


AW2915B7 


Hs.82733 


400419 


AF084545 




407811 


AW1 90902 


Hs.40098 


400289 


X07820 


. Hs.2258 


414774 


X02419 


Hs.77274 


409420 


Z15008 


Hs.54451 


428227 


AA321649 


Hs.2248 


414476 


AA301857 


Hs.76224 


431639 


AK000680 


Hs.266175 


421958 


AA357185 


Hs.109918 


425247 


NM_OG5940 


Hs.1 55324 


444342 


NM_014398 


Hs.10887 


421493 


BE300341 


Hs.1 04925 


407939 


W05508 


Hs.312679 


429344 


R94038 


Hs.1 99538 


402727 


AL035668 




412420 


Hs.73853 


423217 




rlS.lb4U 


424206 


NM_003734 


Hs.198241 


415138 


C18356 


Hs.295944 


424479 


AF064238 


Hs.149098 


445537 


AJ245671 


Hs.1 2844 


417079 


U65590 


Hs.B1134 


421634 


AA437414 


Hs.106283 


439569 


AW602166 


Hs.222399 


431346 


AA371059 


Hs.251636 


448901 


AK001021 


Hs.22505 


450983 


AA305384 


Hs.25740 


422424 


AI186431 


Hs.296638 


458781 


A1444821 


Hs.63085 


445413 


AA151342 


Hs.12677 


432350 


NfVL005865 


Hs.274407 


403106 






402075 






404860 






434037 


AF116601 




405738 






427585 


D31152 


Hs.179729 


439898 


AW505514 


Hs.209561 


452567 


D87120 


Hs.29882 


401271 






411339 


BE164598 


Hs.274251 



Unigene Title 

penoslin (OSF-2os) 
secreted phosphoprotein 1 (osteopontin, 
ESTs, Weakly similar ta S64054 hypolheti 
interteukin 8 

chitinase 3-like 1 (cartflage grycoprote 
small inducible cytokine A4 (homologous 
rysozyme (renal amyloidosis) 
matrix metalloprotelnase 1 (interstitial 
protease Inhibitor 3, skirwlerived (SKAL 
small inducible cytokine subfamily A (Cy 
corticotropin releasing hormone 
GR02 oncogene 

gb:Homo sapiens (clone WR4.12VL) anlkh 
matrix metailoprotelnase 7 (matrilysin, 
matrix metalloprotelnase 11 (stromelysin 
microfibrillar-assoclated protein 4 
fibrillin 1 (Marfan syndrome) 
fibrillin 2 (congenital contracture ara 
monokine induced by gamma interferon 
matrix metalloprotelnase 12 (macrophage 
small inducible cytokine subfamily B (Cy 
hexabrachion (tenascin C cylotactin) 
nidogen 2 
Target 

cysteine knot superfamSy 1, BMP an tag on 
matrix metalloprotelnase 10 (stromelysin 
plasminogen activator, urokinase 
laminin, gamma 2 (niceln (100kD), kafinl 
small inducible cytokine subfamily B (Cy 
EGF-containing fibuQn-tike extracellula 
phosphoprotein associated with GEMs 
ras homolog gene family, member H 
matrix metalloprotelnase 11 (stromelysin 
similar to lysosome-associated membrane 
ectodermal-neura) cortex (with BTB-fike 
ESTs. Weakly similar to A49019 dynein he 
. inhibin. beta C 

NM_025065:Homo sapiens hypothetical pro! 
bonemorphogenetic protein 2 
collagen, type VII, alpha 1 (epidermolys . 
amine oxidase, copper containing 3 (vase 
tissue factor pathway inhibitor 2 
smootheiin 

EGF-like-domain, multipte 6 

interteukin 1 receptor antagonist 

hypothetical protein FU 10262 

CEGP1 protein 

ubiquitin specific protease 3 

hypothetical protein FU10159 

ER01 (S.cerevisiaeHike 

prostate differentiation factor 

ESTs, Weakly similar to MPP3.HUMAN MAGUK 

CGI- 147 protein 

protease, serine, 16 (thymus) 

C8000064*:gqi0432393[emb|CAC10283.1| (A 

ENSP00O0O251056*:P1asma membrane calcium 

C1003394*:gI|12314272|emb|CAC00591.1l (A 

WW domain-containing oxldoreductase 

CX00039(T:gil6014646|gb|AAF01438.1|AF187 

collagen, type X, alpha 1 (Schmid metaph 

K1AA1715 protein 

predicted osteoblast protein 

C9000559*:gi|12314195|emblCAB99338.1| (A 

hypothetical protein FU20375; KJAA1797 
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R1 


R2 


R3 


11.22 


1.40 


1.44 


8.40 


1.31 


1.38 


7.88 


206 


244 


7.54 


266 


4.85 


6.14 


1.18 


1.18 


5.32 


1.34 


1.36 


5.27 


0.61 


0.57 


5.17 


7.47 


35.60 


4.30 


1.75 


1.54 


4.29 


1.35 


1.60 


4.20 


6.27 


45,75 


4.00 


0.94 


0.93 


3.66 


1.25 


1.25 


3.58 


2.07 


226 


3.41 


4.37 


3.37 


3.09 


0.39 


0.40 


296 


0.44 


0.45 


290 


1.94 


3.46 


280 


1.67 


3.10 


277 


5.62 


26.07 


271 


226 


291 


240 


0.47 


0.37 


234 


0.86 


0.86 


233 


1.54 


212 


210 


1.01 


1.01 


1.99 


3.19 


17.50 


1.94 


249 


1.84 


1.94 


202 


5.39 


1.84 


1.99 


&55 


1.76 


a 33 


0.31 


1.76 


1.13 


1.16 


1.75 


1.33 


1.27 


1.58 


242 


1.39 


1.57 


202 


ass 


1.55 


1.57 


1.55 


1.41 


0.79 


0.67 


1.36 


1.39 


1.34 


1.34 


1.34 


1.56 


1.31 


1.63 


222 


1.27 


200 


1.67 


1.25 


0.16 


0.30 


1.23 


1.70 


4.34 


1.19 


0.27 


0.47 


1.17 


1.59 


293 


1.16 


0.82 


0.80 


1.16 


1.05 


1.05 


1.15 


201 


242 


1.10 


1.64 


1.52 


1.10 


0.31 


0.31 


1.03 


1.51 


1.42 


1.02 


277 


268 


1.00 


1.64 


5.45 


1.00 


1.51 


5.20 


1.00 


1.55 


4.30 


1.00 


1.48 


4.24 


1.00 


1.67 


3.95 


1.00 


1.40 


3.90 


1.00 


1.58 


3.70 


1.00 


1.36 


295 


1.00 


1.45 


1.60 


1.00 


1.28 


1.59 


1.00 


1.10 


1,31 


1.00 


212 


1.00 


1.00 


205 


1.00 
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403005 






tt100002r:g1{1817556|dbj|BAA1367Z1| (D 


1.00 


1.89 


1.00 


431148 


Z83850 




Human DNA sequence from PAC 82J1 1 and co 


1.00 


1.69 


1.00 


434939 


AF161422 


Hs.306567 


Homo sapiens HSPC304 mRNA, partial cds 


1.00 


1.86 


1.00 


431753 


X76029 


Hs.2841 


neuromedin U 


1.00 


1.82 


1.00 


419121 


AA374372 


Hs.89626 


parathyroid hormone-tike hormone 


1.00 


1.69 


1.00 


435505 


AF200492 


Hs.211238 


lnler)eu)drt*1 homolog 1 


1.00 


1.67 


1.00 


452401 


NMJ307115 


Hs.29352 


tumor necrosis factor, alpha-induced pro 


1.00 


1.67 


1.00 


405397 






C1600144r:gi|12053709iemb]CAC20419.1| ( 


1.00 


1.66 


1.00 


404488 






NM_030958*:Homo sapiens organic anion tr 


1.00 


1.66 


1.00 


441206 


BE552314 


Ks.131823 


ESTs. Weakly similar to TERA HUMAN pisa 


1.00 


1.49 


1.00 


407853 


AA336797 


Hs.40499 


dlckkopf (Xenopus laevls) homolog 1 


1.00 


1.47 


1.00 


446119 


D29527 


Hs.290931 


ESTs 


1.00 


1.47 


1.00 


406471 






Target Exon 


1.00 


1.44 


1.00 


402110 






C1 80001 78:gi|1 1 990779|emb|CAC19649. 1 J (A 


1.00 


1.42 


1.00 


407911 


AF104922 


Hs.41565 


growth differentiation factor 8 


1.00 


1.40 


1.00 


404829 






C1002937*:gi|7499208|pjr|iT20993 hypolhe 


1.00 


1.37 


1.00 


421925 


S80310 


Hs. 109620 


acidic epididymal glyroproteln-fike 1 


1.00 


1.26 


1.00 


406076 


AL390179 




Homo sapiens mRNA; cDNA DKFZp547P134 (fr 


1.00 


1.19 


1.00 


456622 


AA972412 


Hs. 13755 


f-box and WD-40 domain protein 2 


1.00 


1.09 


1.00 


416018 


AW138239 


Hs.78977 


proprotein convertase subtiKsto/kexin t 
casein, kappa 


1,00 


1.04 


1.00 


409357 


M73628 


Hs.54415 


1.00 


1.03 


1.00 


436584 


AW976319 


Hs.94806 


ATP-bindlng cassette, sub-family A (ABC1 


1.00 


0.84 


0.84 


436178 


BE152396 


Hs.21590 


hypothetical protein OKFZp564O0523 


1.00 


0.91 


0.80 


402522 






C1000568*:giit2697965|dbjfBAB21801.1| (A 


1.00 


0.60 


0.67 


405735 






ENSP00000252164*:KIAA1578 protein (Fragm 


1.00 


0.66 


0.56 


401905 






ENSP00000252232*:Sterol regulatory eteme 
C500093r:gi|9558454|dbj|BAB03398.1| (AB 


1.00 


0.65 


0.52 


404152 






1.00 


0.58 


0.51 


416693 


AI750878 


Hs.87409 


thrombospondin 1 


1.00 


0.85 


0.51 


451375 


A1792066 


Hs.283902 


Homo sapiens BAC clone RP1 1-48W1 3 from 


1.00 


0.46 


0.38 


430132 


AA204686 


Hs.234149 


hypothetical protein FU 20647 


1.00 


0.84 


0.33 


456983 


AI0816B7 


Hs!l1355 


thymopoietin 


1.00 


0.61 


0.29 


438681 


AW384815 


Hs! 149208 


KIAA1555 protein 


1.00 


0.60 


0.28 


409038 


T97490 


Hs.50002 


smaD inducible cytokine subfamily A (Cy 


1.00 


0.39 


0.19 


409196 


NMJW1874 


Hs.334873 


carboxypeptjdase M 


1.00 


0.43 


0.13 


410023 


AB017169 


Hs.57929 


slit (Drosophila) homolog 3 


1.00 


0.30 


0.12 


420674 


NMJ100055 


Hs.1327 


butyrylcholioesterase 


1.00 


0.30 


0.08 


415165 


AW887604 


Hs.78065 


complement component 7 


1.00 


0.08 


0.06 


425545 


N9B529 


Hs.158295 


Homo sapiens, clone MGC'12401, mRNA, com 


1.00 


0.10 


0.01 


448256 


BE614149 


Hs.20814 


CGl-27 protein 


0.96 


1.32 


1.55 


417389 


BE260954 


Hs.82045 


midkine { neurits growth-promoting factor 


0.95 


3.15 


234 


403214 






NM_01 6232*^0010 sapiens inlerteukin 1 re 


0.94 


1.63 


2.51 


414799 


A1752416 


Hs.77326 


insulin-like growth factor binding prote 


0.92 


1.87 


1.60 


406665 


U22961 


Hs. 164411 


albumin 


0.92 


1.09 


1.03 


401519 






C15000476 # :gi|12737279lreflXP - 0121 63.1| 


0.88 


1.46 


3.44 


417501 


AL041219 


Hs.82222 


sema domain, immunoglobulin domain (ig), 


0.87 


0.41 


0.50 


409632 


W74001 


Hs.55279 


serine (or cysteine) proteinase inhibito 


0.85 


1.38 


1.43 


405494 






C2001837 , :gll12697903ldbjIBAB2l77ai| {A 


0.83 


1.46 


4.65 


444171 


AB018249 


Hs.10458 


smaD inducible cytokine subfamily A (Cy 


0.60 


0.91 


0.91 


439706 


AW872527 


Hs.59761 


ESTs, Weakly similar to DAP 1 .HUMAN DEATH 


0.79 


0.56 


0.43 


436395 


AI683487 


Hs. 15221 3 


wingless-type MMTV integration site fami 


0,77 


1.47 


2.37 


426716 


NM 006379 


Hs. 171 921 


sema domain, immunoglobulin domain (tg), 


0.75 


1.13 


1.18 


431347 


AI133461 


Hsl251664 


insulin-like growth factor 2 (somatomedi 


0.68 . 


2.61 


1.87 


413753 


U17760 


Hs.75517 


laminin, beta 3 (nicein (125kD), katinin 


0.68 


2.70 


4.96 


426322 


J 05068 


Hs.2012 


transcobalamin I (vitamin B12 binding pr 


0.67 


1.50 


1.36 


426514 


BE616633 


Hs.170195 


bone morphogenetlc protein 7 (osteogenic 


0.56 


2.05 


2.46 


422282 


AF019225 


Hs.114309 


apoGpoprotein L 


0.55 


3.91 


3.92 


409757 


NMJJ01898 


Hs.123114 


cystatin SN 


0.53 


2.72 


2.93 


427450 


AB014526 


Hs.178121 


WAA0626gene product 


0.52 


1.34 


1,97 


414555 


N98569 


Hs.76422 


phosphotipase A2, group tlA (platelets, 


0.50 


1.04 


1.05 


423774 


L39064 


Hs.1702 


inlerteukin 9 receptor 


0.49 


2.81 


6.46 


404977 






Insufin-tika growth factor 2 (somatomedi 


0.28 


5.78 


5.17 


428336 


AA503115 


Hs.183752 


microseminoprolein, beta- 


0.21 


1.47 


1.56 


451668 


Z43948 


Hs.326444 


cartilage acidic protein 1 


0.18 


4.05 


3.60 


428651 


AF196478 


Hs.188401 


annexin A10 


0.17 


5.14 


27.75 


421110 


AJ250717 


Hs.1355 


cathepsin E 


0.12 


5.49 


45.35 



TABLE 4B 

Pkey: Unique Eos probeset identifier number 
CAT number Gene cluster number 
Accession: Genbank accession numbers 



pkey CAT Number Accession 

431146 32854 J Z83850 AA45971 7 AW965384 AA333635 
.434037 37918J AF1 16601 Al 110691 AF063566 
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TABLE 4C - 

Pkey: Unique number corresponding to an Eos probesel 

Ret Sequence source. The 7 digit numbers In this column are Genbank Identifier (GI) numbers. "Dunham L et d." refers to the publication entitled "The DNA 

sequence of human chromosome 22.° Dunham I. el at., Nature (1999) 402:469495. 
Strand: Indicates DNA strand from which exons were predicted. 
NLposition: Indicates nucleotide positions of predicted exons. 



Pkey 


Ref 


Strand 


NLposition 


401271 


9797373 


Minus 


61292-61911 


401519 


6649315 


Plus 


157315-157950 


401905 


8671966 


Plus 


153965-154441,156599-156819 


402075 


8117407 


Plus 


121907-122035,122804-122921,124019-124161,124455-124610,125672-126076 


402110 


6131678 


Minus 


173889-174062 


402522 


9798493 


Plus 


20605-20731 


402727 


9211324 


Plus 


54596-54777' 


403005 


5791501 


Minus 


16945-17053,20018-20403 


403106 


7331404 


Plus 


77162-77350,8133841511 


403214 


7630945 


Minus 


76723-77027,79317-79484 


404152 


9884757 


Plus 


41 1 1 1 ^1 28 1 ,45495-457 1 6.47801-4791 0 


404488 


8113286 


Minus 


64835-64994 


404829 


6624702 


Minus 


4913-5093,7310-7469,9472-9621,9951-10082 


404860 


8979555 


Plus 


65852-66081 


404977 


3738341 


Minus 


43081-43229 


405494 


8050952 


Minus 


70284-70518 


405735 


9931101 


Minus 


29854-29976 


405738 


9943998 


Plus 


44370-45410 


406076 


9123123 


Plus 


89972-90319 


406397 


9256243 


Minus 


127317-127454 


406471 


9795566 


Plus 


8738347589 
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TABLE SA: Genes uprequtaled In bladder cancer , 

Pkey: Unique Eos probeset identifier number 

ExAccn. Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Tffle: Unigene gene title 

R1 90th percenfite of bladder tumor Als divided by the 90lh percentile of normal body sample Als 



Pkey 


ExAccn 


UnigenelD 


Unigene iiue 




459702 


AJ 204935 


gD.anu Jcuix i otraiagene scruzo Drain o i 


ii ni 

1 l MO 


4379 1 5 


AI637993 


Hs.202312 


Homo sapiens done m 1 N leraZut leratoca 


in An 


4U4917 






Target Exon 


q ec 


401056 






Ci 100051/ :gi]/iydlUo[gDjAAr4tMyU.l| (At 


q on 
s.uu 


AA"7A1C 


AlooU/a/ 


U, 4 C OQOO 
HS.l0O99£ 


ESTs 


HQ? 


427335 


AA44B542 


Hs.251677 


G antigen 7B 


O.DD 






nS-04o£00 


to IS 


8.35 


401335 






Target Exon 


7 OA 

r.sra 


424264 


Do (MOO 


HS.23S388 


It, — ,. nMA contianna fmm rlnno DD1 TfM D 1 A 

Human una sequence from clone Kri-ou4oi4 


f .00 


409041 


AB033Q25 


U_ COCO 4 

Hs.50081 


Hypotneucai protein, Ar_uoioou jwAAiiy 


7 on 


436608 


AAoZBaBU 




down syndrome critical region protein DS 


4 86 


451950 


AW29Z317 


Hs.213307 


COT* 


A AS 


406542 






C1 9000728 :gi|1 2585552(SplQ9Y2Ul |Z257_HU 


3.73 


437931 


A1249468 


Hs. 124434 


ESTs 


^ 7n 
o./u 


443133 


A lnT3D7Q 

AlUooarO 


Ur> a 1 no 
nS.4lJ/y 


colS 


i fin 


434487 


■rt A 10C7 

AF143867 


Hs.337588 


ESTs, Moderately similar to booto/ aipna 


"i 17 

O.J/ 


402239 






Target Exon 


1 17 
0.0/ 


443162 


T49951 


Hs.9029 


Di^iir434tjUo»i proiein 


91 
J.Zl 


4U3383 






Target Exon 


3 13 


4383 15 


R56795 


HS.82419 


ESTs 


irw 
o.w 


452827 


AI571835 


Hs.55468 


ESTs 


i m 
o.ui 


416225 


AA577730 


Hs.188684 


ESTs, Weakly similar to PC4259 reman 


O.Ul 


402948 






NM_ 02 5206. Homo sapiens nypotneiJcaJ prot 




429983 


W92620 


Hs.260855 


ESTs 




429238 


NM_002849 


Hs. 198288 


protein tyrosine phosphatase, receptor t 


0 70 


444371 


BE540274 


Hs.239 


forkhead box Ml 


0 7C 


417003 


AL038170 


Hs.80756 


betame-hornocystBine methyltransferase 


2.70 


414906 


AA1 57911 


Hs.72200 


ESTs 


2.70 


425206 


NMJ)Q2153 


4 1—4 frlAfl 

Hs.155109 


hydroxysterolc (17-oeta) dehydrogenase 2 


£.00 


422283 


AW411307 


Hs.1 14311 


CDC45 (cell division cycle 45, S.cerevls 


2.62 


429345 


R11141 


Hs. 199695 


hypothetical protein 


4t,Ol 


414221 


. AW450979 




gb:UI-H-Bl3-ala-a-12-0-UU1 NCLCGAPJ3U 




.402305 






C1 9000735*:gi|4508027|ref|NP - 003414.1| z 


2.54 


432842 


AW674093 


Hs.334822 


hypothetical protein MGC4485 




427719 


A1393122 


Hs. 134726 


ESTs 


Z.Di 


455797 


BE091833 




go:IL2^TO/37-£t>wo«>fl/^iw d iu/ji Homo 


<f.OU 


414807 


AI738616 


Hs.77348 


hydroxyprostagtandin dehydrogenase 15-(N 


1 AO 


456967 


AW004056 


Hs. 168357 


T-box 2 


1 AQ 
<I.4S 


406387 






Target Exon 


9 AQ 
£.40 


417997 


AA418189 


Hs.23017 


i > ,nM a . n in»T7ii7 c. */ 

Homo sapiens cDNA: FLJ2Z747 ns. clone K 


0 AH 

Z.40 


415752 


BE3 14524 


HS.7B776 


putative transmembrane protein 


9 ifi 


411248 


AA551538 


Hs.334605 


Homo sapiens cDNA RJ 14408 fis, clone HE 


O A1 
£.40 


434293 


NM_004445 


Hs.3796 


EphB6 


2.42 


433078 


AW015188 


Hs.1 21575 


Homo sapiens cDNA FU12231 ns, clone MA 


i An 
4t4U 




HJMJUUUOO 


ns. lowto 




2.38 


418322 


AA284166 


Hs.84113 


cyclin-dependent kinase inhibitor 3 (CDK 


135 


452012 


AA307703 


Hs.279766 


Wnesln family member 4A 


2.34 


445600 


AF034603 


Hs.12953 


PTPRF Interacting protein, binding prate 


2.33 


418941 


AA452970 


Hs.239527 


E1B-55kDa-asscciated protein 5 


2.33 


454609 


AW810204 




gb:MR4-ST012SO21 19W17^08 ST0125 Homo 


2.28 


444476 


AF020038 


Hs.1 1223 


bocitrate dehydrogenase 1 (NAOP). sofub 


125 


420005 


AW271106 


Hs.133294 


ESTs 


2.22 


439826 


NMJ14965 


Hs.6705 


KIAA1042 protein 


2.22 


405531 






Target Exon 


2.21 


436569 


BE439539 


Hs.279837 


glutathione S-transferase M2 (muscle) 


2.18 


404394 






ENSPCO000241075:TRRAP PROTEIN. 


2.17 


427479 


8E410092 


Hs.178471 


K1AA0798 gene product 


117 


435904 


AF261655 


Hs.8910 


1,2-alpha-mannosidase IC 


113 


431620 


AA126109 


Hs.264981 


Z^-oligoadenyiafe synthetase 2 (69-71 


112 


426682 


AV660038 


Hs7056 


UDP gfycosyl transferase 1 family, polype 


110 


422765 


AW409701 


Hs.1578 


baculovira! 1AP repeat-containing 5 (sur 


110 


451385 


AA017656 




gb2e39h01.r1 Soares retina N2b4HR Homo 


109 


403477 






C30021BO*:gi]7662420|reflNP_055738.1| Ki 


109 


417151 


AA194055 


Ks.293658 


ESTs 


108 


448262 


AW880830 


Hs.186273 


ESTs 


107 


415192 


D17793 


Hs.78183 


aldo-keto reductase family 1, member C3 


104 


402994 






NM_002463*:Homo sapiens myxovirus (irrilu 


104 
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10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 



426053 


1168 105 


HS,1 lt\aZ 


pciy(A>- tun ding proieai, cytoplasmic l 


o no 


423271 


W47225 


Hs. 126256 


iniensuwn i, beta 


C\J\ 


419741 


NW_007019 


Hs,93002 


ubtquilin carrier protein E2-C 


1.33 


407581 


R48402 


Hs.1 73508 


P3ECSL 


1 OK 


410197 


NM.005518 


Hs.59889 


34iydroxy4-melhy{g)utaryl-Coen2yma A sy 


1.95 


427122 


AW057736 


Hs.323910 


HER2 receptor tyrosine kinase {c-erb*b2, 


1.93 


436481 


AA379597 


Hs.5199 


HSPC150 prolan similar to uraqumivcon 


1.93 


436251 


BE515065 


Hs.296585 


nucleolar protein (KKE/D repeat) 


1.89 


401961 






NM_021626:Homo sapiens serine carboxypep 


1.86 


434042 


AI589941 


Hs.8254 


Homo sapiens, Similar b tumor different 


4 OC 

1.85 


447532 


AK000614 


Hs.18791 


hypothetical protein FLJ206O7 


1.84 


418526 


BEO1902O 


Hs.B583B 


solute carrier family 16 (monocarbcxytlc 


4 01 

1.83 


429612 


AF062649 


Hs.252587 


pituitary turnor-transfbrmlng 1 


1.80 


422164 


NM_014312 


Hs.1 12377 


cortic a! thymocyte receptor (X. laevts 


1.80 


422247 


U18244 


Hs.1 13602 


solute carrier family 1 (high affinity a 


1.79 


410407 


X66839 


Hs.63287 


carbonic anhydrase IX 


1.78 


412115 


AK001763 


Hs.73239 


hypothetical protein FU 10901 


1.77 


414809 


A1434699 


Hs.77356 


transferrin receptor (p90, C071) 


1.75 


432210 


AI567421 


Hs.273330 


Homo sapiens, clone IMAGE3544662, mRNA, 


1.74 


427239 


BE270447 


Hs.1 74070 


ubiquitin carrier protein 


1.74 


459198 


Ai 086347 


Hs.1 51 138 


ESTs 


1.74 


421066 


AU076725 


Hs.1 01408 


branched chain aminotransferase 2, mitoc 


1.71 


424687 


J05070 


Hs.1 51738 


matrix metaHoproteinase 9 (gelatinase B 


1.70 


450663 


H43540 


Hs.25292 


ribonuclease HI, large subunit 


1.70 


417324 


AW265494 




ESTs 


1.67 


453883 


AIS38516 


Hs.347524 


cofactor required for Sp1 transcriptiona 


1.66 


428000 


R35145 


Hs.291904 


accessory proteins BAP31/BAP29 


1.65 


450835 


AW403954 


Hs, 25237 


mesenchymal stem cell protein DSCD75 


1.63 


423397 


NM_00183fi 


Hs.1 652 


chemokine (C-C motif) receptor 7 


1.62 


415440 


D83782 


Hs.78442 


SREBP CLEAVAGE-ACTIVATING PROTEIN 


1.62 


428028 


U52112 


Hs.182018 


interleukin-1 receptor-associated kinase 


1.62 


426783 


Z 19084 


Hs.172210 


MUF1 protein 


1.62 


445937 


AI452943 


Hs.321231 


UDP-Gal:betaGlcNAc beta 1,4- galactosyit 


1.61 


445462 


AA378776 


Hs.288649 


hypothetical protein MGC3077 


1.60 


400965 






C1 10021 90*:gil1 27 37279J refl XP_0 1 2163.1 1 


1.59 


432269 


NW_002447 


Hs.2942 


macrophage stimulating 1 receptor (c-met 


1.59 


429578 


AI969028 


Hs.99389 


ESTs 


1.59 


449027 


AJ271216 


Hs.22880 


dipeptidylpeptidase III 


1.59 


431840 


AA534908 


Hs.2860 


POU domain, class 5, transcription facto 


1.59 


417900 


BE250127 


Hs.82906 


CDC20 (cell division cycle 20, S. cerevi 


1.59 


429002 


AW248439 


Hs.2340 


junction ptakogloDln 


1.57 


442410 


AW996503 


Hs.1 97680 


ESTs 


1.56 


407601 


AC002300 


Hs.37129 


sodium channel, nonvol tags-gated 1, beta 


1.55 


418543 


NM-005329 


Hs.85962 


hyaluronan synthase 3 


1.54 


424611 


NM.001421 


Hs.151139 


E74-like factor 4 (ets domain transcript 


1.54 


414732 


AW410976 


Hs.77152 


rriinbhromosome maintenance deficient (S. 


1.54 


408930 


AA146721 


Hs.334586 


• hypothetical protein FU21588 


1.53 


448993 


AW71630 


Hs.8127 


K1AA0144 gene product 


1.52 


414053 


BE391635 


Hs.75725 


transgelin 2 


1.51 


433662 


W07162 


Hs.1 50826 


CATX-8 protein 


1.50 


432562 


BE531048 


. Hs.278422 


DKFZP586G1 122 protein 


1.50 


402260 






NM.001436*:Homo sapiens fibrillarln (FBL 


1.48 


426127 


L36983 


Hs.167013 


dynamin2 


1.48 


427557 


NMJ502659 


Hs.179657 


plasminogen activator, urokinase recepto 


1.48 


418026 


BE379727 


Hs.83213 


fatty acid binding protein 4, adipocyte 


1.47 


418960 


NM.004494 


Hs. 89525 


hepatoma-derived growth factor (high-mob 


1.46 


428293 


BE250944 


Hs.1 83556 


solute carrier family 1 (neutral amino a 


1.46 


432344 


AI476474 


Hs.248156 


ESTs 


1.46 


453449 


W16752 


Hs.32981 ~ 


sema domain, immunoglobulin domain (Ig), 


1.46 


450690 


AA296696 


Hs.333418 


FXYD domain-containing bn transport reg 


1.46 


441940 


AW298115 


Hs.1 28152 


ESTs 


1.45 


409893 


AW247090 


Hs.57101 


mintehromosome maintenance deficient (S. 


1.44 


439318 


AWB37046 


Hs.6527 


G protein-coupled receptor 56 


1.42 






ns. I \ twuu 


singeo ^urosopxiiia^iixB issa urcnin las 


1 41 

1.11 


428928 


BE409838 


Hs.194657 


cadherin 1, type 1, E-cadherin (epitheti 


1.41 


445417 


AK001058 


Hs.12680 


Homo sapiens cDNA FU 10196 fis, clone HE 


1.39 


441565 


AW953575 


Hs.303125 


p53-induced protein PIGPC1 


1.37 


4391B0 


A1393742 


Hs.199067 


v-erb-b2 avian erythroblaslic leukemia v 


1.35 


418399 


AF131781 


Hs.84753 


hypothetical protein FU12442 


1.32 


432636 


AA340854 


Hs.278562 


claudin7 


1.32 


439053 


BE2445B8 


Hs.6456 


chaperonln containing TCP1 , subunit 2 (b 


1.32 


413762 


AW411479 


Hs.848 


FK5064)inding protein 4 (59kO) 


1.31 


453914 


NMJJ00507 


Hs.574 


fructose-1,&bisphosphatase 1 


1.28 


430056 


X97548 


Hs.228059 


KRAB-associated protein 1 


1.24 


451524 


AK0014S6 


Hs.26516 


hypothetical protein FU10604 


1.23 


431441 


U81961 


Hs.2794 


sodium channel, nonvoltage-gated 1 alpha 


1.20 


439863 


BE547830 


Hs.9408 


paired ImmunogtobuGn-iike receptor beta 


1.19 
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451541 


BE279383 


Hs.26557 


406906 


225424 




429504 


X99133 


Hs.204238 


414002 


NMJ06732 


Hs.75678 


431369 


BE184455 


Hs.251754 


TABLE 56 


I 





pIakophSin3 

gb:H£8piens protein-serin e/lhreonlneki 
RpocaCn 2 (oncogene 24p3) 
FBJ murine osteosarcoma viral oncogene h 
secretory leukocyte protease Inhibitor ( 



1.16 
1.11 
1.00 
0.66 
0.61 



Pkey: Unique Eos probeset identifier number 
CAT number Gene cluster number 
Accession: Genbank accession numbers 



Pkey 
414221 

417324 
436608 
4513B5 
454609 
455797 

TABLE SO* 



CAT Number Accession 

142696 1 AW45»79M136653M136656AW419381AA984358AM92073BE168^ 

BE011368 BE0 1 1 362 8E01 1 21 5 BE01 1 365 BE011363 

166714 1 AW265494 AA455904 AA195677 AW265432 AW991605 AA456370 

42361 j AA62B980AI126603 BE504035 

867B7_1 AA017656 AA017374 AA019761 

1226517J AW810204 AW810555 AW810196 AW810619 AW810507 

1366826J BE091833 BE091 874 BE091871 



Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham t. el at." refers to the publication entitled The DNA 

sequence of human dvomosome 22/ Dunham I. el a)., Nature (1 999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 
NLposition: Indicates nucleotide positions of predicted exons. 



Pkey 


Ref 


Strand 


NLposition 


400965 


7770576 


Minus 


173043-173564 


401066 


8217436 


Plus 


71448-71574 


401335 


9884881 


Plus 


15736-16352 


401981 


4581193 


Minus 


124054-124209 


402239 


7690131 


Pius 


38175-38304,42133-42266 


402260 


3399665 


Minus 


113765-113910,115653-115765,116808-116940 


402305 


7328724 


Plus 


4083241362 


402948 


9368458 


Minus 


143456-143626,143808-143935 


402994 


2996643 


Minus 


4727-4969 


403383 


9438267 


Minus 


119837-121197 


403477 


9958251 


Plus 


111834-11200B 


404394 


3135305 


Minus 


37121-37205,37491-37762,4105341140,413224159*4177341919 


404917 


7341851 


Plus 


4933049498 


405531 


9665194 


Plus 


35602-35803 


406387 


9256180 


Plus 


116229-116371,117512-117651 


406542 


7711499 


Plus 


117335-118473 
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TABLE 6A: Genes uprequfated in bladder cancer m 

Ptey: Unique Eos probesel Identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene UUe 

R1 90th percentile of bladder tumor Als divided by the 90th percentile of normal urothellum biopsy Ais 

R2 90th percentile of bladder tumor Als divided by the 90th percentile of normal urothellum biopsy and normal bladder Als 



Pkey 


ExAccn 


UnigenelD 


Unigene Title 


R1 


R2 


439926 


AW014875 


Hs.137007 


ESTs 


11.31 


11.31 


413324 


V00571 


Hs.75294 


corticotropin releasing hormone 


9.15 


9.15 


421110 


AJ250717 


Hs.1355 


cathepsinE 


9.07 


9.07 


417308 


H60720 


Hs.81892 


K1AA0101 gene product 


a so 


8.50 


418406 


X73501 


Hs.84905 


cytokeralin 20 


8.10 


8.10 


446619 


AU076643 


Hs.313 


secreted phosphoprotein 1 (osteopontin, 


7.98 


7.98 


433001 


AF217513 


Hs.279905 


clone HQ0310PRO0310p1 


7.67 


7.67 


408243 


Y00787 


Hs.624 


interteukin 8 


7.56 


7.56 


416065 


BE267931 


Hs.78995 


proliferating cell nuclear antigen 


7.17 


5.17 


425397 


J 04088 


Hs.156346 


topolsomerase(DNA) il alpha (170kD) 


7.17 


8.24 


414183 


AW957446 


Hs.301711 


ESTs 


7.14 


4.62 


41B007 


M13509 


Hs.83169 


matrix metaltoproteinase 1 (interstitial 


7.12 


7.12 


426125 


X87241 


Hs.166994 


FAT tumor suppressor (Drosophila) homolo 


6.90 


3.38 


427337 


Z46223 


Hs.176663 


Fc fragment of IgG, tow affinity 111b, r 


6.85 


4.98 


441633 


AW958544 


Hs.112242 


normal mucosa of esophagus specific 1 


6.42 


6.42 


438091 


AW373062 




nuclear receptor subfamily 1, group 1, m 


6.32 


6.32 


413063 


AL035737 


Hs.75184 


chilinase 3-iike 1 (cartilage glycoprote 


6.09 
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414219 


W20010 


Hs.75823 


ALU-fused gene from chromosome 1q 


5.93 


4.68 


405033 






C1002652*^i|544327|sp!Q04799]FMO5_RABlT 


5.84 


5.84 


413132 


NMJM36823 


Hs.75209 


protein kinase (cAMP-dependent, cataiytl 


5.79 


5.68 


428336 


AA503115 


Hs.183752 


microseminoproteln, beta- 


5.78 


4.57 


449230 


BE613348 


Hs.211579 


melanoma cell adhesion molecule 


5.69 


5.82 


423673 


BE003054 


Hs.1695 


matrix metalloproteinase 12 (macrophage 


5.69 


7.30 


415511 


A1732617 


Hs.182352 


ESTs 


5.65 


5,65 


426028 


NM_001110 


Hs.172028 


a disintegrin and metalloproteinase doma 


5.60 


5.60 


421948 


L42583 


Hs.334309 


keratin 6A 


5.59 


14.20 


428651 


AF196478 


Hs.188401 


annexin A10 


5.55 


5.55 


424008 


R02740 


Hs.137555 


putative chemokine receptor; GTP-binding 


5.38 


5.59 


406687 


M31126 




matrix metalloproteinase 11 (stromefysln 


5.36 


5.34 


439453 


BE264974 


Hs.6566 


thyroid hormone receptor interactor 13 


5.35 


5.35 


408246 


N55669 


Hs.333823 


mitochondrial ribosoma) protein L1 3 


5.20 


3.50 


427678 


BE267756 


Hs.180312 


mitochondrial ribosoma! protein S16 


5.13 


4.10 


426158 


NM_001982 


Hs.199067 


v-ert>b2 avian erythroblastic leukemia v 


4.97 


3.71 


442315 


AA173992 


Hs.7956 


ESTs, Moderately simflar to ZN91.HUMAN Z 


4.90 


4.90 


418322 


AA284166 


Hs.84113 


cydln-dependent kinase inhibitor 3 (COK 


4.90 


4.77 


417720 


AA205625 


Hs.208067 


ESTs 


4.84 


7.34 


423979 


AF229181 


Hs.136644 


CS box-contaJning WO protein 


4.81 


4.81 


420961 


L40904 


Hs.100724 


peroxisome proliferative activated recep 


4.81 


4.43 


433470 


AW960564 




transmembrane 4 superfamfly member 1 


4.72 


4.72 


429138 


AB020657 


Hs.197298 


NSI-binding protein 


4.71 


4.71 


408063 


BE086548 


Hs.42346 


catcineurin-binding protein calsarcin-1 


4.71 


4.71 


452714 


AW770994 


Hs.30340 


hypothetical protein WAA1165 


4.69 


4.69 


442432 


BE093589 


Hs.38178 


hypothetical protein FU23468 


4.68 


4.68 


424834 


AK001432 ' 


Hs.153408 


Homo sapiens cDMA FU10570 fe. done NT 


4.65 


4.65 


446921 


AB012113 


Hs.16530 


small inducible cytokine subfamily A (Cy 


4.64 


4.64 


427490 


Z95152 


Hs.1 78695 


mitogen-activated protein kinase 13 


4.63 


5.12 


412490 


AW803564 


Hs.288350 


Homo sapiens cDNA: FU22528 fis, clone H 


4.61 


4.61 


418030 


BE207573 


Hs.83321 


neuromedin B 


4.60 


4.60 


401192 






Target Exon 


4.60 


4.29 


426761 


AI015709 


Hs.172089 


Homo sapiens mRNA; cONA DKFZp586l2022 (f 


4.59 


3.51 


452747 


BE153855 


Hs.61460 


Ig superfamily receptor LN1R 


4.58 


5.33 


449618 


AI076459 


Hs.15978 


WAA1272 protein 


4.58 


4.58 


423725 


AJ403108 


Hs.132127 


hypothetical protein LOC57822 


' 4.55 


4.55 


415701 


NM.003878 


Ha.78619 


gamma-ghitamyl hydrolase (conjugase, fol 


4.52 


4.70 


446742 


AA232119 


Hs.16085 


putative &protein coupled receptor 


4.49 


4.11 


419433 


AA814807 


HsJ395 


hypothetical protein FU23182 


4.48 


4.48 


412326 


R07566 


Hs.73817 


small inducible cytokine A3 (homologous 


4.47 


4.47 


427528 


AU077143 


Hs.179565 


minichromosome maintenance deficient (S. 


4.45 


4.45 


444371 


BE540274 


Hs.239 


forkheadboxMI 


4.44 


3.87 


444006 


BE395085 


Hs.10086 


type 1 transmembrane protein Fn14 


4.43 


3.63 


424308 


AW975531 


Hs.154443 


minichromosome maintenance deficient {S. 


4.43 


4.43 


401093 






C12000W:gi}6330167{dbj|BAA86477.1| (A 


4.40 


4.40 


447644 


AW861622 


Hs.108646 


Homo sapiens cDMA FU 14934 fis, done PL 


4.39 


4.39 


417933 


X02308 


Hs.82962 


thymidylate synthetase 


4.38 


4.35 


409461 


AA382169 


Hs.54483 


N-myc (and STAT) interactor 


4.36 


3.68 


401451 






NM_0Q4496*:Homo sapiens hepatocyte nude 


4.35 


4.35 


450746 


D82673 


Hs.278589 


genera) transcription factor II, 1 


4.35 


3.38 
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5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 



414683 


S78ZJ8 


HsJ6888 


hypolheGcal protein MGC12702 


4.34 


4.74 


434203 


BE262677 


Hs.283558 


hypothetical protein PR01855 


4.31 


4.31 


417615 


BE548641 


Hs.82314 


hypoxanthine phosphoribosyl transferase 1 


4.30 


4.30 


416815 


U41514 


Hs.80120 


UDP^acely!-alpha-D-galactosarnine:por/p 


4.30 


4.30 


440086 


NMJJ05402 


Hs,288757 


v-rat simian leukemia viral oncogene horn 


4.29 


4.29 


417715 


AW969587 


Hs.86366 


ESTs 


4.27 


7.45 


409757 


NM_001898 


Hs.123114 


cystalin SN 


4.24 


3.39 


412140 


AA219691 


Hs.73625 


RAB6 interacting, klnestrvlike (rabkines 


4.24 


4.24 


432842 


AW674093 


Hs.334822 


hypothetical protein M6C44B5 


4.21 


4.12 


445847 


T51454 


Hs.82845 


Homo sapiens cDNA: FU21930 fis, done H 


4.20 


4.20 


436856 


AI469355 


Hs.127310 


ESTs 


4.19 


4.19 


428450 


NM.014791 


Hs.184339 


K1AA01 75 gene product 


4.16 


4.90 


425234 


AW152225 


Kte.165909 


ESTs, Weakly similar to 138022 hypoM 


4.14 


4.14 


409231 


AA446644 


Hs.692 


GA733-2 antigen; epithelial gfycoprolefn 


4.14 


5.99 


426283 


NMJXJ3937 


Hs.169139 


kynureninase (L-kynurenine hydrolase) 


4.12 


4.12 


446849 


AU076617 


Hs.16251 


cleavage end poryadenytallon specific fa 


4.12 


3.43 


400843 






NM_Q03105*:Homo sapiens swtfirwelated 


4.11 


5.51 


449722 


BE280074 


Hs.23960 


cycOnBI 


4.09 


4.09 


405506 






Target Exon 


4.09 


3.75 


420344 


BE463721 


Hs.9710l 


putative G protein-coupled receptor 


4.07 


4.07 


426997 


BE620738 


Hs.173125 


peptrdytpror/l isomerase F (cyclophilin 


4.05 


4.80 


456525 


AW468397 


Hs.100000 


SI 00 cafcium-binding protein A8 (catgran 


4.03 


7.64 


437150 


R51407 


Hs.77910 


IhyoVcxy^efoylglutaryt-ajen^ sy 


4.02 


4.02 


413794 


AF234532 


Hs.61638 


myosin X 


4.02 


4.02 


422511 


AU076442 


Hs.1 17938 


collagen, type XVII, alpha 1 


4.02 


4.72 


414020 


NM.002984 


Hs.75703 


small Inducible cytokine A4 (homologous 


4.01 


3.99 


416391 


A1878927 


HsJ9284 


mesoderm specific transcript (mouse) horn 


4.01 


4.01 


422809 


AK001379 


Hs.121028 


hypothetical protein FU10549 


4.00 


4.00 


400277 






Eos Control 


4.00 


3.47 


415791 


H09366 


Hs.78853 


uradl-DNA glycosylase 


3.99 


3.37 


412610 


X90908 


Hs.74126 


fatty acid binding protein 6, ileal (gas 


3.98 


4.77 


427557 


NM_002659 


Hs.179657 


plasminogen activator, urokinase recepto 


3,98 


3.95 


413753 


U17760 


Hs.75517 


iaminin, beta 3 {nicein (125kO), kalinin 


3.96 


6.95 


420859 


AW468397 


Hs.100000 


S100 calcium-binding protein A8 (calgran 


3.92 


5.04 


400409 


AF153341 




Homo sapiens winged helix/forkhead trans 


3.91 


3.88 


408988 


AL1 19844 


Hs.49476 


Homo sapiens clone TUA8 Cri-du-chat regi 


3.90 


190 


411678 


A1907114 


Hs.71465 


squalene epoxidase 


3.89 


189 


429113 


D28235 


Hs.196384 


prostaglandin-endoperoxlde synthase 2 (p 


3.87 


187 


428428 


AL037544 


Hs.184298 


cycfin-dependenl kinase 7 (homotog of Xe 


3.87 


3.87 


442932 


AA457211 


Hs.8858 


bromodomain adjacent to zinc finger doma 


3.85 


4.50 


429083 


Y09397 


Hs.227817 


BCL2-related protein A1 


3.85 


3.85 


439963 


AW247529 


Hs.6793 


platelet-activating factor acetylhydrola 


3.82 


177 


441362 


BE614410 


Hs.23044 


RAD51 (S. cerevisiae) homolog (E coil Re 


3.82 


3.82 


430589 


AJ002744 


Hs,246315 


UDP-N-acetyl-alpha-D-galactosamine:polyp 


3.81 


3.81 


417771 


AA804698 


Hs.82547 


relincdc acid receptor responder (tazaro 


3.81 


162 


430259 


BE550182 


Hs.1 27826 


RalGEF-fike protein 3, mouse homolog 


3.80 


180 


447973 


AB011169 


Hs.20141 


similar to S. cerevisiae SSM4 


3.77 


3.77 


404875 






NM_022819*:Homo sapiens phosphdipase A2 


3.77 


146 


411299 


BE409857 


Hs.69499 


hypothetical protein 


3.76 


3.76 


418827 


BE327311 


Hs.47166 


HT021 


3.76 


176 


446839 


BE091926 


Hs.16244 


mitotic spindle coBed-cdl related prot 


3.75 


175 


407137 


T97307 




gb7e53h05.s1 Soares fetal liver spleen 


3.73 


173 


433376 


AI249361 


Hs.74122 


caspase 4, apoptosis-related cysteine pr 


3.71 


171 


400294 


N95796 


Hs.278695 


Homo sapiens prosteln mRNA, complete cds 


3.70 


3.45 


409518 


BE384836 


Hs.3454 


. KIAA1821 protein 


3.69 


3.69 


430024 


AI808780 


Hs.227730 


integrin, alpha 6 


3.69 


3.69 


426088 


AF038007 


Hs.166196 


ATPase, Class I, type 8B, member 1 


3.68 


168 


418478 


U3B945 


Hs.1 174 


cycfin-dependent kinase inhibitor 2A (me 


3.68 


166 


414761 


AU077228 


Hs.77256 


enhancer of zeste (Orosophila) homolog 2 


3.67 


3.67 


413670 


AB000115 


Hs.75470 


hypothetical protein, expressed in osteo 


3.67 


141 


424840 


D79987 


Hs.1 53479 


extra spindle poles, S. cerevisiae, homo 


3.67 


188 


434263 


N34895 


Hs.44648 


ESTs 


3.65 


3.65 


438280 


AW015534 


Hs.217493 


annexinA2 


3.63 


136 


443426 


AF098158 


Hs.9329 


chromosome 20 open reading frame 1 


3.63 


168 


408989 


AW361666 


Hs.49500 


KIAA0746 protein 


161 


166 


416640 


BE262478 


Hs.79404 


neuron-specific protein 


3.60 


4.22 


416926 


H03109 


Hs.108920 


HT018 protein 


3.59 


159 


414366 


W70171 


Hs.75939 


uridine monophosphate kinase 


3.59 


3.53 


402727 






NM_025065:Homo sapiens hypothetical prot 


3.58 


3.58 


419381 


AB023420 


Hs.90093 


heat shock 70kD protein 4 


156 


3.77 


416114 


A1695549 


Hs.183868 


glucuronidase, beta 


155 


155 


424941 


AA128376 


Hs.1 53884 


ATP binding protein associated with cell 


155 


3.55 


431958 


X63629 


Hs.2877 


cadherin 3, type 1, P-cadherin (placenta 


154 


4.63 


429238 


NM.002849 


Hs.198288 


protein tyrosine phosphatase, receptor t 


153 


3.92 


420159 


AI572490 


Hs.99785 


Homo sapiens CDMA: FU21245 Rs, done C 


3.51 


5.77 


400289 


X07820 


Hs.2258 


matrix rnsialioproteinase 10 (strometysin 


ISO 


150 


418203 


X54942 


Hs.83758 


CDC28 protein kinase 2 


150 


150 
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415220 


AA431B80 


Hs.181174 


ESTs, Weakly similar to T1 9201 hypolhsU 


3.50 


3,50 


428371 


AB012193 


Hs.183874 


culBn4A 


3.46 


3.46 


418663 


AK001100 


Hs.41690 


desmocoflin 3 


3.45 


4,74 


404977 






InsuMce growth factor 2 (somatomedi 


3.45 


3.89 


422863 


AW500087 


Hs.119014 


zinc finger protein 175 


3.44 


3.44 


434061 


AW024973 


Hs.283675 


NPO009 protein 


3.41 


5.64 


418113 


A1272141 


Hs.83484 


SRY (sex deternTining region Y>box 4 


3.41 


4.32 


431689 


AA3056B8 


Hs.267695 


UDP-GaI:betaGtcNAc beta 1,3-galactosyltr 


3.40 


3.40 


411943 


BE502436 


Hs.7962 


ESTs, Weakly similar to $44608 CQ2F5.6 p 


3.39 


4.27 


420005 


AW271106 


Hs.133294 


ESTs 


138 


3.40 


453450 


AW797627 


Hs.347459 


ADP-ribosyiation factor 6 


3.38 


3.87 


410315 


AI638871 


Hs.17625 


Homo sapiens cDNA: FU22524 fis, done H 


3.36 


3.36 


426839 


AI767756 


Hs.82302 


Homo sapiens cDNA FU14814 fis, clone NT 


3.35 


3.35 


437469 


AW753112 


Hs.15514 


hypothetical protein MGC3260 


3.35 


3.35 


407151 


H25836 


Hs.301527 


ESTs, Moderately simila/ to unknown (H.s 


3.34 


3.34 


42B157 


A1738719 


Hs.198427 


hexo3dnase2 


3.33 


3.73 


450293 


N36754 


Hs.171118 


hypothetical protein FU00026 


3.33 


3.33 


400750 






Target Exon 


3.33 


3.33 


450139 


AK001838 


Hs.296323 


serum/ghjcocortlcoid regulated kinase 


3.33 


3.33 


412636 


MM_004415 




desmoplakin(OPI, DPIl) 


3.30 


4.81 


447578 


AA912347 


Hs.136585 


ESTs, Weakly similar to JC5314 CDC28/cdc 


3.27 


3.38 


430315 


NH.004293 


Hs.239147 


guanine deaminase 


3.26 


4.30 


421594 


R45689 


Hs.21889 


Homo sapiens cDNA FU12978 lis, clone NT 


3.26 


3.41 


443030 


R6804B 


Hs.9238 


hypothetical protein FU23516 


3.19 


3.34 


436911 


AA142984 


Hs.5344 


adaptor-related protein complex 1, gamma 


3.17 


3.40 


440006 


AK00Q517 


Hs.5844 


hypothetical protein FU20510 


3.06 


3.52 


443171 


BE281128 


Hs.9030 


TONDU 


3.05 


3.83 


429343 


AK000785 


Hs.199480 


Homo sapiens, Similar to epsin 3, clone 


3.01 


3.53 


408380 


AF1 23050 


Hs.44532 


diubiquitin 


2.99 


4.11 


421508 


NMJM4833 


Hs.105115 


absent in melanoma 2 


2.99 


3.67 


439750 


AL359053 


Hs.57664 


Homo sapiens mRNA full length insert cDN 


2.97 


4.55 


452046 


AB018345 


Hs.27657 


KIAA0802 protein 


2.95 


3.39 


451940 


AI735759 


Hs.52620 


integrin, beta 8 


2.93 


3.58 


407722 


BE252241 


Hs.38041 


pyridoxa) (pyrldoxine, vitamin B6) kinas 


2.90 


3.68 


422282 


AF019225 


Hs.1 14309 


apolipoprotein L 


Z89 


3.57 


402230 






Target Exon 


2.88 


5.36 


406685 


M18728 




gb:Human nonspecific crossre acting antig 


2.80 


5.80 


417880 


BE241595 


Hs.82B48 


selectin L (lymphocyte adhesion molecule 


2.79 


3.89 


447957 


NM_014821 


Hs.20126 


KIAA0317 gene product 


2.75 


3.45 


418004 


U37519 


Hs.87539 


aldehyde dehydrogenase 3 family, member 


2.75 


3.46 


417275 


X63578 


Hs.295449 


paralbumin 


2.73 


3.54 


431211 


M86849 


Hs.323733 


gap junction protein, beta 2, 26kO (conn 


2.72 


8.39 


401781 






Target Exon 


2.62 


4.15 


407242 


M18728 




gb:Human nonspecific crossreacting antig 


2.54 


5.96 


428423 


AU076517 


Hs.184276 


solute earner family 9 (sodium/hydrogen 


2.52 


4.27 


430200 


BE613337 


Hs.234896 


geminin 


2.52 


4.19 


451035 


AU076785 


Ks.430 


plasfoi 1 (1 isoform) 


2.51 


4.15 


443162 


T49951 


Hs.9029 


OKFZP434G032 protein 


2.48 


3.66 


441495 


AW294603 


Hs.127039 


ESTs 


2.45 


3.60 


449246 


AW411209 


Hs.23363 


hypothetical protein FU 10983 


2.45 


3.52 


401780 






NM_005557*:Homo sapiens keratin 16 (foca 


222 


4.49 


417079 


U65590 


Hs.81134 


intsneukm 1 receptor antagonist 


2.20 


3.55 


422168 


AA586894 


Hs.112408 


S1 00 calcium-binding protein A7 (psorias 


Z15 


6.08 


439394 


AA149250 


Hs.56105 


ESTs 


2.05 


3.95 


427315 


AA179949 


Hs.175563 


Homo sapiens mRNA; cDNA DKFZp564N0763 (f 


1.79 


3.88 



TABlSiQ 

Pkey: Unique Eos probeset identifier number 
CAT number Gene cluster number 
Accession: Genbank accession numbers 



412636 13165 1 NM 004415 AL031058 M77830 BE149760AW752599AW848723AW376697AW376817AW376699 AW848371 AW376782 AW848789 

AW361413 AW849074 AW997139 AW799304 AW799309 BE077020 BE077017 BE185187 AW997196 BE156621 BE179915 BE0O6561 
BE143155 AW890985 BE002107 AW103521 AA857316 AW383133 BE01 1378 AW1 70253 BE185750 AW886475 8E160433 J05211 BE082576 
BE082584 BE004047 AW607238 AW377700AW377699BE082526 BE082505 BE082507 BE08251 4 AW1 78000 AW1 77933 AI905935 
AW747877 AW748114 BE148516 AW26532B AW847678 AW847688 AW365151 AW365148 AW3651 53 AW365166 AW365175 AW365157 
AW365154 AW068840 BE005272 AW365145 BE001925 BE182166 BE144243 BE001923 AI951766 A1434518 BE1B4920 BE184933 AI284090 
BE184941 AW804674 BE184924 C04715 W39488 AW995615 BE184948 BE159646 AW606653 AA099891 AA131 128 AA337270 AA340777 
AW384371 AA852212 R58704 AW366566 AW364859 AA025851 AA025852 AA4551 00 AA719958 AW352220 AW996245 BE165351 BE073467 
AA377127 AW890264 AW609750 AW391912 AW849690 T87267 AW853812AA852213 W74149 BE009090 AA056401 H91011 AW368529 
AW390272 C18467 AW574920 N57176 AA026480 AW576767 H93284 AA026863 AW177787 AA026654 AW1 77786 BE092134 BE092137 
BE092136 AW177784AI022862BE091653AW376811 AW848592AA040018BE185331 BE182164 AA368564 AW951576 T29918AA131077 
W95048 W25458 AW205789 H90899 N29754 W32490 R20904 BE167181 BE167165 N84767 H27408 H30146 AI190590 C03378 A1554403 
AI205263 AA128470 AI392926 AF139065 AW370813 AW370827 AW798417 AW798780 AW798883 AW798569 R33557 AA149190 C03029 
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AW177783 AA0888S6 AW370829 AA247685 BE002273 A17608 16 A1439101 AW879451 AI700963 AA45 1923 A1340326 AI590975 T48793 
A1568096 AI142882 AA039975 A1470146 AA946936 BE067737 BE057786 W19287 AA644381 AA702424 AI417612 AJ306554 AI686869 
AI568892 AW190555 AI571075 AI220573 AA056527 AI471874 AI304772 AW51 7828 AI915596 AI627383 A1270345 AW021 347 AW1 66807 
AW105614 AI346078 AA552300 W95070 AI494069 A191 1 702 AA149191 AA026864 AI830049 A1887258 AW780435 AI910434 A1819984 
A185B282 AI078449 AI025932 AI850584 AI635B7B AA026047 AA703232 D1 2062 AW1 92085 AA658154 AW514597 AW591892 T87181 
AA782066 AW243815 AW1 50038 AW268383 AW004633 AI927207 AA782109 AW473233 A1804485 AW1 69216 A1572669 AA6021 82 
AW015480 AW771865 AI270Q27 AA951616 AA283207 AI076S62 AI498487 AJ346053 Af783914 H44405 AW7991 18 AA 1 28330 AA5 15500 
AA918281 W02156 AI905927 AA022701 W38382 R20795 T77861 AWB6087B 

433470 3672 1 AW960564 AA092457 T55890 D56120 T92525 AI8159S7 BE182608 BE1B2595 AW080238 M90657 AA347236 AW951686 AW176446 

AA304671 AW583735 T61714 AA316968 A1446815 AA343532 M083489 AA488005 W52095 W39480 N57402 D82638 W25540 W52847 
D82729 D5B990 BE619182 AA3151B8 AA308636 AA112474 W76162 AA088544 H52265 AA301631 H80982 AA113788 BE620997 AW651691 
AA343799 BE613669 BE547180 BE546656 F11933 AA376800 AW239185 AA376086 BE544387 BE619041 AA452515 AA001806 AA190873 
AA180483 AA159546 F00242 AI940609 AI940602 AI189753 T97663 T661 10 AW062896 AW062910 AW062902 AI051622 A1828930 AA102452 
AI685095 AI81 9390 AA557597 AA383220 AI804422 A1633575 AW338147 AW603423 AW606800 AW750567 AW510672 AI250777 AA083510 
AW629109 AW513200 AA921353 AI677934 AI148698 Al 955858 AA173825 AA453027 A1027865 AW375542 AA454099 AA733014 AI591384 
R79300 R80023 AA8431 08 AA626058 AA844898 AW375550 AA889018 AI474275 AW205937 A1052270AW388117AW388111 AA699452 
AI242230 N47476 H38178 AA366621 AA1 13196 AA130023 H39740 T61629 AI885973 AW083671 AA179730 AA305757 AI285455 N83956 
AA216013 AA336155 AW999959 T97525 AA345349 T91 762 AA771981 AI285092 AI591386 BE392486 BE385852 AA682601 AI682884 
AA345840 T85477 AA292949 AA932079 AA098791 D82607 T48574 AW75203B C06300 

438091 44964 1 AW373062T55662AI299190BE174210AW579001 H01811 W40186R67100AI923886AW952164 AA628440 AW898607 AW898616 

AA7091 26 AW898628 AWB98544 AA947932 AW898625 AW898622 A1275125 AI185720 AW510698 AA987230 T52522 BE467708 AW243400 
AW043642 Al 288245 AJ 1 86932 052654 055017 D5Z715 D52477 D53933 054679 AI298739 AI146984 AJ922204 N98343 BE17421 3 AA845571 
AJ813854 AI214518 AI635262 AI139455 AJ707807 A/698085 AW884528 AI024768 AJ004723 AW087420 AJ565133 N94964 AI268939 
AW513280 AI061 126 A1435818 AI859106 AI360506 A1024767 AA513019 AA757598 X56196 AA902959 AI334784 AIB50794 AA010207 
AW890091 AW513771 AI951391 AI337671 T52499 AA890205 A1640908 H75966 AA463487 AA358688 AI961767 AI866295 AA780994 
A198591 3 BE174196 AA029094 AW592159 T55581 N79072 AI61 1201 AA910812 AI220713 AW149306 AI758412 AA045713 R79750 N76096 



TABLE 6C ___ - 

Pkey: Unique number corresponding to an Eos probeset 

Raf: Sequence source. The 7 digil numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. et al/ refers to the publication enfilled "The ONA 

sequence of human chromosome 22.' Dunham i. et aL, Nature (1999) 402:489495. 
Strand: Indicates DNA strand from which exons were predicted. 
NLposition: Indicates nucleotide positions of predicted exons. 



Pkey 


Ref 


Strand 


NLposition 


400750 


8119067 


Plus 


198991-199168 199316-199548 


400843 


9188605 


Plus 


5863^970,765i7784,8892.9023,9673-9807,10534-10789,15254-15403,23827-23958 


401093 


8516137 


Minus 


22335-23166 


401192 


9719502 


Minus 


69559-70101 


401451 


663406B 


Minus 


119926-121272 


401780 


7249190 


Minus 


28397-2861 7,2892029045,291 35-29296.2941 1-29567.29705-29787,30224-30573 


401781 


7249190 


Minus 


83215-83435,83531-83656,83740^3901,84237-84393,84955-85037,86290-66814 


402230 


9966312 


Minus 


29782-29932 


402727 


9211324 


Pius 


54596-54777 


404875 


9801324 


Plus 


96588-96732,97722-97831 


404977 


3738341 


Minus 


43081-43229 


405033 


. 7107731 


Minus 


142358-142546 


405506 


6466489 


Plus 


8001 4-80401 ,80593-81 1 25 
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TABLE 7 A: Genes downrequlated in bladder cancer 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

R 1 90th percentile of normal uroMum biopsy Ats ajvtded by 75(h percentile of bladder tumor Ais 

R2 90th percentile of normal urothellum biopsy and normal bladder Als divided by the 90th percentile bladder tumor Als 

Unigene Title 

C21000152:giI6226483|sp|Q52118|YM03_ERWS 
cell division cycle 27 
ESTs, Weakly similar to PC4259 ferritin 
Wolf-Hirschhom syndrome candidatB 1-Uk 
Target Exon 

alcohol dehydrogenase 1B (class I), beta 
gb:nc05h02.s1 NCI_CGAP_Pr1 Homo sapiens 
ESTs, Highly similar to T42626 secreted 
WNT1 inducible signaling pathway protein 
ESTs 

ESTs, Moderately similar to ALU7 JWMAN A 
ESTs 

biphenyl hydrolase-like (serine hydrolas 
Homo sapiens mRNA; cDNA DKFZp585M2022 (f 
ESTs 

neural ceil adhesion molecule 1 
ESTs, WeaWy similar to ALU1.HUMAN ALU S 
ENSP00000217725*:Lamlnin alpha-1 chain p 
glycetonephosphate O-acyl transferase 
platelet-derived growth factor receptor, 
ESTs 

hypothetical protein MGC10848 
protein kinase (cAMP-dependenl, catalyti 
Homo sapiens cDNA: FU231 76 fis. clone L 
ESTs 

Meis (mouse) homotog 2 
monoamine oxidase B 
ESTs 

Homo sapiens mRNA; cDNA DKFZp586B21 1 (fr 
chondroitin sulfate proteoglycan 4 (mela 
peripheral myelin protein 22 
CDC20 (cell division cycle 20, S. cerevi 
RAB23, member RAS oncogene family 
ESTs 
ESTs 

pory(A)-binding protein, nuclear 1 
BCL2-associated athanogene 2 
ESTs 
ESTs 
ESTs 
ESTs 

C16000922:gi|7499103|pirl|T20903 hypothe 
ESTs 

K1AA0443 gene product 
ESTs 

Homo sapiens cDNA FU13329 fis, done OV 
gb.7e8lflO.sl Scares fetal liver spleen 
ESTs, Weakly similar to unnamed protein 
Target Exon 
K1AA1628 protein 

solute carrier family 25 (mitochondrial 
transforming growth factor beta-stimulat 
sodium calcium exchanger 
G protein-coupled receptor kinase 5 
cyclin-dependent kinase inhibitor 1 C (p5 



Pkey 


ExAccn 


UnigenelD 


403010 






426796 


S78234 


Hs.172405 


416225 


AA577730 


Hs.188684 


459006 


AW298631 


Hs.27721 


404917 






426488 


X03350 


Hs.4 


419543 


AA244170 




453180 


N46243 


Hs.1 10373 


428957 


NM_003881 


Hs.194679 


451529 


A1917901 


1 1. AAA0 J J 

Hs.208641 


417076 


AW973454 


Hs.238442 


425438 


T62216 


Hs.270840 


450515 


AW304226 




432873 


AW837268 


Hs.279639 


452123 


AI267615 


Hs.38022 


424378 


W28020 


Hs.167988 


437601 


AA761546 


Hs.248844 


402096 






439553 


AI018768 


Hs.12482 


412810 


M21574 


Hs.74615 


458651 


AW6124B1 


Hs.104105 


414033 


AL079707 


Hs.207443 


433572 


AL046B59 


Hs.3407 


413305 


NM.000426 


Hs. 3235 11 


420412 


AW976674 


Hs.1 251 03 


421406 


AF1 79897 


Hs.104105 


417446 


AL1 18671 


Hs.82163 


452886 


AI478250 


Hs.1 3751 


446808 


AA703226 


Hs.16193 


443105 


X96753 


Hs.9004 


421348 


N/194048 


Hs.103724 


433070 


N75346 


Hs.306121 


420059 


AF161486 


Hs.94769 


408491 


A1088063 


Hs.7882 


447384 


AI377221 


Hs.40528 


421998 


R74441 


Hs.1 17176 


409619 


AK001015 


Hs.55220 


444795 


AI193356 


Hs.1 60316 


408495 


W68796 


Hs.237731 


417124 


BE122762 


Hs.25338 


443998 


Al 620661 


Hs.295276 


406303 






422994 


AW891802 


Hs.296276 


422195 


AB 007903 


Hs.1 13082 


452877 


AI250789 


Hs.32478 


452487 


AW207659 


Hs.6630 


417159 


R01761 




445607 


AA488107 


Hs.30156 


406274 






410611 


AW954134 


Hs.20924 


426495 


NM_001151 


Hs.2043 


422292 


A1815733 


Hs.114360 


413040 


AA193338 


Hs.12321 


429623 


NM_005308 


Hs.211569 


456607 


A1660190 


Hs.106070 


429143 


AA333327 


Hs.197335 


400288 


X06256 


Hs.149609 


442498 


U54617 


Hs.8364 


414449 


AA557660 


Hs.76152 


412014 


AI620650 


Hs.43761 


425100 


AF051850 


Hs.154567 


432094 


A1658580 


Hs.61426 


427818 


AW511222 


Hs.193765 



integrin, alpha 5 (fibronectin receptor, 
Homo sapiens pyruvate dehydrogenase kina 
decorin 

ESTs. Weakly similar to A46010 X-Jinked 
supervfUln 

Homo sapiens mesenchymal stem cell prole 
ESTs 



R1 


R2 


4.86 


2.49 


4.28 


2.48 


4.04 


2.07 


3.62 


2.66 


3.78 


ZOO 


2.64 


1.79 


2.63 


3.42 


2.32 


3.24 


2.20 


3.80 


2.18 


3.69 


203 


3.03 


2.00 


5.17 


1.89 


246 


1.70 


2.79 


1.69 


2.46 


1.65 


4.67 


1.50 


3.34 


1.48 


3.02 


1.47 


3.22 


1.46 


2.30 


1.39 


2.89 


1.36 


2.80 


1.35 


3.49 


1.34 


293 


1.32 


5.13 


1.31 


4.07 


1.27 


2% 


1.26 


1.95 


1.25 


3.44 


1.24 


207 


1.24 


263 


1.23 


280 


1.22 


3.43 


1.20 


6.01 


1.00 


7.92 


1.00 


7.38 


1.00 


6.40 


1.00 


5.53 


1.00 


5.05 


1.00 


4.73 


1.00 


A 10 


1.00 


4.37 


1.00 


4.37 


1.00 


4.35 


1.00 


3.90 


1.00 


3.90 


1.00 


3.82 


1.00 


3.62 


1.00 


3.59 


1.00 


3.06 


1.00 


289 


1.00 


261 


1.00 


251 


1.00 


205 


1.00 


201 


0.97 


245 


0.90 


247 


0.88 


5.08 


0.88 


3.13 


0.78 


1.88 


0.70 


3.90 


0.68 


3.41 


0.63 


3.75 



IfiBLS ZB 



Pkey: Unique Eos probeset Identifier number 
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CAT number Gene cluster number 
Accession: Genbank accession numbers 



Pkey CAT Number Accession 

417159 1653899.1 R01761 R01760 N49787 

419543 185745.1 AA244170 AIO18O07 AA244355 

450515 63710 1 AW304226 AW008420AA349212H15015 AA317021 AI829484 H25661 H8l744Ai906147AA837938AW167766AW603578AW842369 

BE439926 AA902417 AW235409 AA010062 AW069319 AI280242 AW672925 HD6B48 H05608 R51905 R45023 AW675471 H28475 AI08B597 
AI19781 5 AI825355 N99134 AI075956 AI4701 22 AA449985 AW662833 AA860423 AA913342 T23825 A1394207 A1310319 T32467 A3 589870 
AI682293 AI810633 BE223045 H14620 AA626645 AA876023 T33571 AA953982 AI138631 H1501S AI304356 AA983831 A1350990 AI143993 
AI708171 AA526961 H26247 W38485 AA847598 H81745 AW855486 BE299605 A1079409 AI278050 A1223168 AI86Q904 AW02541 5 AI339003 
AA393692 A1354302 A1492838 N80194 A101S651 N34543 BE295397 A1085154 W24135 



TABLE 7C , , 

Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. et al." refers to the publication entitled The DNA 

sequence of human chromosome 22/ Dunham I. et al. Nature (1999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 
N'Lpcsflion: Indicates nucleotide positions of predicted exons. 



Pkey 


Ref 


Strand 


NLposition 


402096 


8117697 


Minus 


24993-25186 


403010 


3132346 


Pius 


78385-79052 


404917 


7341851 


Pius 


49330-49498 


406274 


7543787 


Plus 


932-1123 


406303 


8575868 


Plus 


173622-173786 
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TABLE 6A: Genes predictive of bladder cancer progression 



Pkey: Unique Eos probeset Identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unpens TIDe: Unpens gene title 

R1 80lh percentile of Ta or T1 tumor Als from patients who upstaged divided by 80th percentile of Ta or T1 tumor AIs from patients who did not upstage 

R2 median of Ta or T1 tumor Als from patients who upstaged divided by the median of TaorTI tumor Als from patients who did not upstage 

Pkey ExAccn UnigenelD Unigene Title 

413324 V00571 Hs.75294 corticotropin releasing hormone 

437802 AJ475995 Hs.122910 ESTs 

444444 AI149332 Hs.14855 ESTs 

445033 AV552402 Hs.72901 mucin 13, epithelial transmembrane 

417771 AA804698 Hs.82547 retinoic acid receptor responder (tazaro 

449618 AI076459 Hs.15978 WAA1272 protein 

407242 M18728 gb:Human nonspecific crossreacting antig 

418318 U47732 Hs.84072 transmembrane 4 superfamily member 3 

423441 R68649 Hs.278359 absent in melanoma 1 like 

405033 C1002652^i}544327lsp|Q04799lFMO5^RABIT 

423024 AA593731 Hs,325823 ESTs, Moderately similar to ALU5.HUMAN A 

425118 AU076611 Hs. 164672 methylene tetrahydrofolate dehydrogenase 

437928 NMJJ05476 Hs.5920 UDP-r4-acetylgtucosamine-2^pimerase^-ac 

446584 U53445 Hs.15432 downregulated in ovarian cancer 1 

436608 AA628980 down syndrome critical region protein DS 

404440 NMJ)21 048:Homo sapiens melanoma antigen, 

435602 AF217515 Hs.283532 uncharacterized bone marrow protein BM03 

424098 AF077374 Hs. 139322 small proline-rich protein 3 

4375S3 AI829935 Hs.130497 ESTs, Weakly similar to MAT8.HUMAN CHLOR 

428036 AW068302 Homo sapiens mRNA for caldesmon, 3* UTR 

446839 BE091926 Hs.16244 mitotic spindle coiled-coil related prot 

448479 H96115 Hs.21293 UDP^-acteylglucosarrune pyr^ 

412059 AA317982 Hs.249721 ESTs, Moderately similar to PC4259 ferri 

401241 AB028989 mi tog en -activated protein kinase 8 inter 

408741 M73720 Hs.646 carboxypeptidase A3 {mast cell) 

415989 AI267700 ESTs 

431070 AW408164 Hs.249184 transcription factor 19 (SC1) 

452140 AB007928 Hs.28169 KIAA0459 protein 

443162 T49951 Hs.9029 DKFZP434G032 protein 

435904 AF261655 Hs.8910 1,2ndpha-rnannosidasetC 

407379 AA332127 Hs.325804 transcription factor 17 

442712 BE465168 Hs.131011 ESTs 

411678 A1907114 Hs.71465 squalene epoxldase 

406791 A1220684 Hs.347939 hemoglobin, alpha 2 

431805 NMJ)14053 Hs.270594 FLVCR protein 

438414 AA806794 Hs.131511 ESTs 

413924 AL119964 Hs.75616 seladin-1 

437679 NMJJ14214 Hs.5753 inositol(myo).1{or 4)-monophosphatase 2 

44591 1 AI965987 Hs.145645 ESTs. Moderately similar to ALU1.HUMAN A 

408349 BE546947 Hs.44276 homeoboxCIO 

422545 X02761 Hs.287820 fibronecBnl 

406643 N77976 Hs.347939 hemoglobin, alpha 2 

407228 M25079 Hs.1 55376 hemoglobin, beta 

449644 AW960707 Hs.148324 ESTs 

402305 C19000735':gil4508027|ref|NP.003414.1| z 

427683 BE545490 Hs.15053 Homo sapiens HCMOGT-1 mRNA for sperm ant 

441690 R81733 Hs.33106 ESTs 

434487 AF143867 Hs.337588 ESTs, Moderately similar to S65657 alpha 

403362 NM_001 61 5 # :Homo sapiens acSn, gamma 2, 

445496 AB007860 Hs.12802 development and differentiation enhandn 

425234 AW152225 Hs. 165909 ESTs, WeaWy simfiar to 138022 hypolhefi 

402366 AV648601 apolipoprotein B (including Ag(x) anlige 

427254 AL121523 Hs.97774 ESTs 

414533 AA149060 Hs.296100 ESTs 

430157 BE348706 Hs.278543 ESTs 

413433 NM_003199 ' Hs.326198 transcription factor 4 

410532 T53088 Hs.155376 hemoglobin, beta 

405779 NM_005367:Homo sapiens melanoma anfigen, 

450455 AL1 17424 Hs.25035 chloride intracellular channeU 

414081 AW969976 Hs.279009 matrix Gla protein 

414761 AU077228 Hs.77256 enhancer of zeste (Drosophfla) homolog 2 

415062 H45100 Hs.49753 uveal autoantigen with coited coil domai 

406317 C2002658*:gll6625694|gblAAF19354.1IAF185 

453259 R93125 Hs.124187 ESTs 

445937 AI452943 Hs.321231 UDP^betaGlcNAcbetal^galactosytt 

434370 AF130988 Hs.58346 ectodysplasin 1. anh'rfroUc receptor 

416058 AW161552 Hs.83381 guanine nudeotklB binding protain 1 1 

174 
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8.30 


4.18 
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Z26 


1.13 


3.27 
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170 


133 


3.56 
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2.09 


0.91 
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133 
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1.81 
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102 


1.26 
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1.23 


1.60 


1.00 


1.73 


1.50 


2.44 


195 
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1.00 


2.56 


1.20 


2.10 


1.72 


2.54 


2.72 


1.12 


3.11 


1.69 


1.38 


1.92 


105 


1.04 


115 


1.69 


105 


2.27 


126 


1.42 


174 


1.60 


105 


1.77 


3.02 


1.57 


1.35 


1.81 


1.50 


1.90 


3.19 


2.25 


1.49 


1.08 


125 


1.80 
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113 


1.63 


2.33 


122 


1.12 


160 


1.83 


105 
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2.44 


1.00 


1.70 


106 


2.54 
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2.26 


1.41 


1.52 


1.56 


142 


1.22 


2.48 
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1.81 


1.53 


1.65 


108 


1.62 
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1.68 
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1.08 
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1.76 
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1.12 
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1.00 
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AAR1017R 


He 17QR47 


Homo sapiens cDNA FU 12195 fis, done MA 


1.49 


2.01 


427055 


AI301740 


Hs.173381 


dihydropyrimldinase-Gke 2 


1.11 


^58 


454244 


R51604 


Hs.300842 


KIAA1608 protein 


1.00 


102 


429503 


AA394183 


Hs.26873 


ESTs 


2.58 


4.08 


422940 


BE077458 




gb:RC1^T0606^)9050(M)154)04 BT0606 Homo 


3.48 


2.46 


407949 


W21874 


Hs.247057 


ESTs, WeaWy similar to 2109260A B cell 


3.13 


1.48 


437312 


AA809350 


Hs.246180 


ESTs 


1.10 


105 


449426 


T92251 


Hs.198882 


ESTs 


1.22 


108 


447620 


AW290951 


Hs.224965 


ESTs 


1.80 


118 


444700 


NMJJ03645 


Hs.11729 


faUy-acid- Coenzyme A ligase, very long- 


1.72 


181 


436258 


AW867491 


Hs.107125 


plasmatemma vesicle associated protein 


1.99 


1.80 


415712 


AW249188 


Hs.169577 


Homo sapiens cDNA FU14743 fis, clone NT 


1.66 


198 


432877 


AW974111 


Hs.292477 


ESTs 


1.63 
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412085 


AW891667 




gb:CM3-NT0089-1 10500-179*09 NT0089 Homo 
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431421 AW96911B Hs.108144 ESTs, WeaWy similar b unnamed protein 

409714 AW367812 Hs.199961 ESTs, Weakly similar b ALU7_HUMAN ALU S 

423013 AW875443 Hs, 22209 secreted modular caJdum-binding protein 

422663 AW500087 Hs.119014 zinc finger protein 175 

5 439737 AI751438 Hs.41271 Homo sapiens mRNA full length Insert cDN 

41 31 96 AA127386 gb:zn90d09.r1 Stratagene lung carcinoma 

439349 AI660898 Hs. 195602 ESTs 

443005 AI027184 Hs.200918 ESTs 

424762 AL 119442 Hs.1 83684 eukaryotic translation initiation factor 

10 427373 AB007972 Hs.1 30760 myosin phosphatase, target subunit 2 

413916 N49813 Hs.75615 apolipoprotein C-ll 

41B332 R34976 Hs.78293 ESTs 

426552 BE297660 Hs.170328 moesln 

456583 AF179897 Hs.104105 Mels (mouse) homolog 2 

15 447214 A1367288 Hs.273621 Homo sapiens cDNA: FU21350 fe, clone C 

449254 W26908 Hs.1 72762 ESTs 

443651 W22152 Hs.282929 ESTs 

421021 AA808018 Hs.109302 ESTs 

419741 NMJJ07019 Hs.93002 ubiquitin carrier protein E2-C 

20 432027 AL096678 Hs.272353 KIAA0957 protein 

452688 AA721140 Hs.49930 ESTs, Weakly similar to putative p150 [H 

417042 C75563 Rs.1 13029 ribosomal protein S25 

443574 U83993 Hs.321709 purinerglc receptor P2X, Egand-gated io 

429372 AA451859 Hs.99253 ESTs 

25 424290 AA33B396 gb:EST43386 Fetal brain I Homo sapiens c 

428518 AW969656 gb:EST381733 MAGE resequences, MAGK Homo 

455649 BE065051 gb:RC1-BT0313-1 1 0500-01 7-c04 BT0313 Homo 

414665 AA160873 Hs.332053 serum amyloid A1 

418298 AA256014 Hs.86682 Homo sapiens cDNA: FU21578 lis, clone C 

30 429655 U48959 Hs.211582 myosin, light polypeptide kinase 

433924 AA618304 Hs.258785 ESTs 

452683 AI089575 Hs.3071 progesterone membrane binding protein 

439437 AI207788 Hs.343628 siahyltransferase 4B (beta^atactosidase 

432314 AA533447 Hs.312989 ESTs 

3 5 400881 NMJ)2508O:Homo sapiens hypothetical prot 

426477 AA379464 gb:EST92386 Skin tumor I Homo sapiens cO 

454741 BE154396 gb:CM24fT0342-0912994J504)05HT0342Homo 

423977 AA333232 gb:EST37283 Embryo, 8 week I Homo sapien 

450396 AU077002 Hs,24950 regulator of G-protein signainng 5 

40 431842 NM 005764 Hs.271473 epithelial protein up-regulated in card 

415157 D63257 gb:HUM514B08B Oontech human placenta po 

418236 AW994005 Hs.337534 ESTs 

454390 AB020713 Hs.56966 KIAA0906 protein 

436143 AA705245 Hs.192189 ESTs 

45 436251 BE515065 Hs.296585 nucleolar protein (KKE/D repeat) 

450735 AI732321 SRY (sex determining region Y)-box 4 

420136 AW801090 Hs.1 95851 actin, alpha 2, smooth muscle, aorta 

447100 AI361801 Hs.16713Q hypothetical protein 

453577 AL043049 gb:OKF2p434A1 523 j\ 434 (synonym: htes3) 

50 408522 A1541214 Hs.46320 Small proline-rich protein SPRK [human. 

412632 AL1 20379 Hs.74294 aldehyde dehydrogenase 7 family, member 

423291 NM.004129 Hs.126590 guanylate cyclase 1. soluble, beta 2 

4561 72 R99050 gb:yq65c02.r1 Scares fetal liver spleen 

452123 AI267615 Hs.38022 ESTs 

55 433900 AA721668 Hs.257761 ESTs 

408436 R31954 Hs.7885 phosphatidyEinositol binding clathrin as 

417123 BE326521 Hs.159450 ESTs 

436023 T81819 Hs.302251 ESTs 

454150 AA131693 Hs.154088 hypothetical protein FU22756 

60 444094 AI695764 Hs.202394 ESTs 

429176 AW975021 Hs.193800 ESTs 

422259 AA307584 gb:EST1 78498 Colon carcinoma (HCCJ ceil 

451164 AA015912 Hs.60659 ESTs, Weakly similar b T46471 hypothetl 

417501 AL041219 Hs.82222 sema domain, immunoglobulin domain (Ig), 

65 455642 BE063965 • gb:OV3^T0295.140200485-h01 BT0296 Homo 

443387 8E139135 Hs.254629 ESTs 

420995 AA282495 Hs.89014 ESTs 

407329 AA576061 Hs.269834 ESTs, Weakly similar to ALU D_HUMAN ill! 

438797 C16161 Hs.2B304O hypothetical protein PR02543 

70 443357 AW016773 low molecular mass ubiquinone-binding pr 

412656 AF006011 Hs.74375 dishevelled 1 (homologous to DrosopWIa 

427377 AU077029 Hs.177543 antigen Identified by monoclonal anffbod 

412200 R08110 Hs.187462 ESTs, Weakly similar b I38022 hypothetl 

432586 AA568548 ESTs 

75 411590 T96183 gb:ye09f07.s1 Stratagene lung (937210) H 

422672 X12784 Hs.1 191 29 collagen, type IV, alpha 1 

420256 U84722 Hs.76206 cadherln 5, type 2, VE-cadherin (vascula 
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419900 AI469960 Hs.170698 ESTs 

410805 AW804742 Hs.84264 acidic protein rich in leucines 

452560 BE077084 Hs.99969 ESTs 

446429 D17408 Hs.21223 calponin t, basic, smooth muscle 

5 424436 AW818426 Hs.4953 golgi autoanfigen, golgtn subfamily a, 3 

447400 AK00O322 Hs, 18457 hypothetical protein FU20315 

422522 AI023428 Hs.34549 ESTs, Highly similar to S94541 1 clone 4 

443696 AW607444 Hs.134622 ESTs 

436094 AI798701 Hs.222222 ESTs 

10 420168 AF217508 Hs.95594 serine carboxypeptidase viteiiogenic-lix 

430325 AF004562 Hs.239356 syntaxin binding protein 1 

439022 AA356599 Hs.173904 ESTs 

420563 AA278327 Hs.136237 ESTs, Moderately similar to Y140_HUMAN H 

429494 AA769365 Hs.126058 ESTs 

15 420689 H79979 Hs.88678 ESTs 

448988 Y09763 Hs.22785 gamma-aminobutyric acid (GABA) A recepto 

439943 AW083789 Hs.124620 ESTs 

442300 AI765908 Hs.129166 ESTs 

449614 A1989490 Hs.197703 ESTs 

20 444363 AI142827 Hs,143656 ESTs 

424479 AF064238 Hs.149098 smoothelin 

437321 AA768966 Hs.292026 ESTs, Weakly similar to 2109260A B ceil 

431926 AW972724 gb:EST38481 6 MAGE resequences, MAGL Homo 

433640 AW390125 Hs.240443 Homo sapiens cDNA: FU23538 fis, done L 

25 415901 H08396 Hs.76118 ubiquitin carboxyMermina) esterase LI 

437199 ALU0175 Hs.306337 Homo sapiens mRNA; cDNA OKFZp564H061 6 {/ 

457450 AW294163 Hs.146127 ESTs 

456678 AF141305 Hs.173736 ancient ubiquitous protein 1 

451079 A1827988 Hs.240728 ESTs, Moderately similar to PC4259 fern 

30 405944 Target Exon 

408877 AA479033 Hs.130315 ESTs, Weakly similar to A47582 B-cell gr 

446682 AW205632 H&211198 ESTs 

431380 AW610282 Hs.291003 ESTs 

442027 A1652926 Hs.128395 ESTs 

35 423578 AW960454 Hs.222830 ESTs 

441495 AW294603 Hs.127039 ESTs 

417900 BE250127 Hs.82906 COC20 (cell division cycle 20, S. ceravi 

443949 AW827419 Hs.235070 ESTs 

440495 AA887212 Hs.14161 hypothetical protein DKFZp434l 1930 

40 449948 R19156 Hs.20798 ESTs 

439564 W77911 Hs,1100Q6 ESTs 

423225 AA852604 Hs.125359 Thy-1 cell surface antigen 

436139 AA765766 Hs.120936 ESTs 

456968 AI174861 Hs,190623 ESTs 

45 437191 fVM_O06846 Ks.331555 serine protease inhibitor, Kazaftype, 5 

411652 AW855393 gb;CM3-CT0275-1 91 099-024^10 a0275 Homo 

420732 AA789133 Hs.88650 ESTs 

409291 AW373472 gb:RC3-BT0523-181 2994)1 1-d12 BT0523Homo 

424415 MM 001975 Hs.146580 enolase 2, (gamma, neuronal) 

50 424927 AW973666 Hs.153850 hypothetical protein C3210Z4 

450946 AA374569 Hs.127698 ESTs, Moderately similar to 21 0926QA Be 

428423 AU076517 Hs.184276 solute carrier family 9 (sodium/hydrogen 

415361 F06724 gb:HSC1lG021 normalized infant brain cDN 

406490 C5001926:Qi|751 1 572|pirl|T42245 probable 

55 410855 X97795 Hs.66718 RAD54 (S.cerevisiaeHike 

440010 AA534930 Hs.127236 hypothetical protein FU12B79 

429508 AW369620 Hs.33944 ESTs, Weakly similar to ALU1_HUMAN ALU S 

426340 Z97989 Hs.169370 FYN oncogene related to SRC, FGR, YES 

416889 AW250318 Hs.80395 mal, T-c»li differentiation protein 

60 451870 A1820991 Hs.8377 ESTs 

444091 AV647924 Hs.282376 ESTs 

410793 AW581906 Ks.66392 intefsectin J (SH3 domain protein) 

452222 AW806287 Hs.21432 SEX gene 

43301 0 AW97001 8 gb:EST382097 MAGE resequences, MAGK Homo 

65 432674 AA641092 Hs.257339 ESTs, Weakly similar to I38022 hypotheti 

438855 AW946278 Hs.6441 Homo sapiens mRNA; cDNA OKFZp586J021 (fr 

448718 AA220235 Hs.153959 hypothetical protein MGC1 5436 

402685 Target Exon 

424528 AW073971 Hs.238954 ESTs, Weakly similar to KIAA1204 protein 

70 422068 A1807519 Hs.104520 Homo sapiens cDNA FU13694 fis, clone PL 

451225 AI433694 Hs.293608 ESTs 

441078 AI453268 Hs.323409 Homo sapiens cONAFU 1411 3 fis, done MA 

409406 H83092 Hs.49605 ESTs 

422297 AW961290 p30 DBC protein 

75 408711 AW376081 Hs.63335 ESTs, Moderately simflar to A46010 X-lin 

426596 AW363332 Hs.171844 Homo sapiens cDNA: FU22296 fis, done H 

417324 AW265494 ESTs 
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408283 


8E141579 




6b:QV2-HT0083-071299-018-bQ5 HT0083 Homo 


415166 


NML003652 


Hs.78068 


carboxypeptkJase Z 


406300 






Target Exon 


411880 


AW872477 




gb:hm30f03jc1 NCLCGAP_Thy4 Homo sapiens 


422287 


F16365 


Hs.1 14346 


cytochrome c oxidase subuntt Vila polype 


422567 


AF111178 


Hs.1 18407 


gfyptcan 6 


436855 


AA732624 


Hs.165852 


ESTs 


403536 






Target Exon 


447733 


AF1 57482 


Hs.1 9400 


MAD2 (mitotic arrest deficient, yeast, h 


417117 


N46778 




gb:yy52b0Zr1 Soares_mu!t]p!e_sclerosis_ 


411690 


AA669253 


Hs.1 36075 


RNA, U2 small nuclear 


443243 


AI452496 


Hs.132056 


ESTs 


423074 


AL1 09963 




FSH primary response (LRPR1, rat) homoto 


408916 


AW295232 


Hs.429 


ATP synthase, H transporting, mitochondr 


449799 


AI143466 


Hs.125060 


ESTs 


415378 


T16964 




gb:NIB2079-5R Normalized infant brain, B 


431089 


BE041395 




ESTs, Weakly similar to unknown protein 


434959 


AW974949 


Hs.186564 


ESTs, Weakly similar to I36022 hypotheti 


416311 


D80529 




gb:HUM081H05B Human fetal brain (TFufrwa 


444614 


R44284 


Hs.2730 


heterogeneous nuclear ribonucteoproteln 


456206 


NM.006895 


Hs.81182 


histamine N-methyttransferase 


410583 


AW7702B0 

nil / # vtuu 


Hs.36258 


ESTs. Moderately similar to JC5238 galac 


430410 


AF099144 


Hs.334455 


tryptase beta 1 


408139 


AA451966 




RAB9-Bke protain 


432621 


AJ298501 


Hs.1 2807 


ESTs, Weakly similar to T46428 hypotheti 


441584 


AW148329 


Hs. 175208 


ESTs 


445940 


060438 


Hs.34779 


ESTs 


453022 


AA031499 


Hs.1 18489 


ESTs 


444008 


BE544855 


Hs.236572 


ESTs, Weakly similar to SFR4.HUMAN SPUC 


442994 


A1026718 


Hs.1 6954 


ESTs 


402085 






C18000504*:gi|2627436jgb}AAB86683.1| (AF 


411918 


AW876354 




gb:PM4-PT001 9-141 299-009-F08 PT0019 Homo 


455506 


AW976165 




gb:EST388274 MAGE resequences, MAGN Homo 


426106 


AI678765 


Hs.21812 


ESTs 


425131 


BE252230 


Hs.99163 


ESTs 


440325 


NMJJ03812 


Hs.7164 


a dislntegrin and melaflloproteinase doma 


420447 


AA687306 


Hs.88448 


ESTs 


428055 


AA420564 


Hs.1 01760 


ESTs 


422110 


AI376736 


Hs.1 11779 


secreted protein, acidic, cysteine-rich 


438581 


AW977766 


Hs.292133 


ESTs, Moderately similar to 178885 serin 


403290 






C1 000101 r:gi|4758212lreflNPJ)0441 1 ,1| d 


408175 


W29089 


Hs.1 9066 - 


hypothetical protein DKFZp66702416 


432390 


AA936177 


Hs.274460 


olfactory receptor, family 5, subfamily 


443441 


AW291196 


Hs.92195 


ESTs 


419925 


AA159850 


Hs.93765 


lipoma HMGIC fusion partner 


445256 


AI858635 


Hs.1 44763 


ESTs 


456381 


AA236606 




gb:zr99b10.r1 NCLCGAP.GCB1 Homo sapiens 


422433 


AA310560 


Hs.153746 


hypothetical protein FU22490 


432529 


A1989507 


Hs.162245 


ESTs 


424951 


AW964082 




gb:EST376155 MAGE resequences, MAGH Homo 


420785 


H89633 


Hs.191346 


ESTs 


411347 


AW838126 




gb:QV2-LT0051-240300-097-f01 LT0051 Homo 


438742 


AW204126 


Hs.1 96543 


ESTs 


414900 


AW452420 


Hs.248678 


ESTs 


443284 


AI369813 


Hs.64783 


ESTs, Weakly similar to T42705 hypotheti 


402049 






Target Exon 


429400 


AW604940 


Hs.201668 


transcription factor 20 (AR1) 


423916 


AW993496 


Hs.1 7235 


Homo sapiens clone TCCC1AQ0176 mRNA sequ 


432495 


AW973537 


Hs.1 86734 


ESTs, WeaWy similar to 161746 pheramone 


414840 


R27319 


Hs.23823 


hairy/enhancer-of-split related with YRP 


428711 


R46414 


Hs 56828 


trinucleotide reneat ccntainina 5 


448609 


AW1 39420 


Hs.7972 


K1AA0871 protein 


443859 


NM 013409 


Hs.9914 


fbOtstelin 


411141 


AW819561 




gb:RC5-ST0293-140200-013-G04 ST0293 Homo 


440116 


AI798851 


Ks.266959 


hemoglobin, gamma G 


417944 


AU077196 


Hs.82985 


collagen, type V, alpha 2 


429640 


U83508 


Hs.2463 


angiopoielin 1 


410064 


X53416 


Hs.195464 


filamin A, alpha (actin-binding prolein- 


458218 


AI435179 


Hs.126820 


ESTs 


443114 


AI033377 


Hs.153298 


ESTs 


427788 


AA412397 


Hs.1 16B5B 


ESTs 


435913 


W95006 


Hs.269559 


ESTs, Weakly similar to S65657 aIpha-1C- 


457949 


W69171 


Hs.71741 


ESTs, Highly similar to 138945 melanoma 


419203 


AA488719 


Hs.190151 


ESTs 


412510 


AI056689 


Hs.133538 


ESTs, WeaWy similar to ALU1.HUMAN ALU S 


413885 
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442180 AA983913 Hs.128929 ESTs 

434256 AI378817 Hs.191847 ESTs 

15 444519 AI160304 Hs.28313 ESTs 
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4211B6 AI798039 Hs.270563 ESTs, Moderately similar to T12512 hypot 

426435 AI827946 Hs.1 24854 hypothetical brain protein my040 

439312 AA833902 Hs.270745 ESTs 
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402294 
417759 
417527 
427526 
455300 
448121 
415855 
425702 
441056 
400311 
451478 
425288 
456397 
405654 
450151 
419851 
406016 
440903 
445026 
414182 
457048 
440542 
422857 
445948 
454002 
413656 
420441 
412062 
408991 
432534 
435136 
451052 
413928 
439448 
403344 
418056 
435428 
419964 
440926 
452625 
452797 
436120 
449567 
409628 
416617 
452266 
404606 
401814 
428403 
433390 
451443 
411188 
452704 
424060 
433331 
428520 
439492 
426736 
416225 
404917 
448955 
402797 
457951 



R13567 

AA203524 

AA405062 

AW891707 

AL045714 

A/921875 

N59555 

H37860 

AF072164 

NMJM2331 

AA354502 

W28339 

A!088196 
AA287987 

A1468079 

W90337 

M136301 

AA400352 

AA889143 

R71461 

AW444S62 



T91703 

AI986160 

H09124 

BE501816 

AW361626 

R27299 

AA281504 

AA442498 

AA970788 

AA524886 

AI791746 

AA811657 

AW196772 

AA724771 

AI369787 

AI246193 

AI990790 

AB021865 

H69311 

AI767250 



AI393048 

AA586950 

AW295527 

BE161168 

AA027823 

X92108 

AI738815 

AA331901 

AF086310 

M431615 

AA577730 



AW207597 Hs.28102 



U23860 
AA149707 



Target Exon 
Hs.12548 ESTs 

gb:zx56e10.r1 So ares fetatJvBr_spleen_ 
Hs.345830 gbzul 2e04.M Scares lestis.NHT Homo sap 

gb:CM3-NT009(WM0500-171-e02 NT0090 Homo 
Hs.128653 hypolhetical protein DKF2p564F013 

gb.-wp07e04j(! t*aSGAPJGstt2 Homo sapien 

gb:yv76f05.s1 Soares fetal liver spleen 
Hs.125720 ESTs 

Hs.137570 Homo sapiens HSFE-1 mRNA, partial cds 
Hs.26456 methionine sulfoxide reductase A 

gb:EST62799 Jurkat T-ceHs V Homo sapfen 
Hs.150580 PTD010 protein 

C12001521:gi|7513934|pirjJT31081 cca3pr 
Hs.22968 Homo sapiens done IMAGE:451939, mRNA se 
Hs. 13477 ESTs, Weakly similar to 120728SA reverse 

Target Exon 
Hs.126623 ESTs 

Hs.282966 ESTs, Moderately similar to 210926QA B c 
Hs.344442 WAA1 105 protein 
Hs.112861 ESTs 

Hs.295655 ESTs, Weakly similar to PC4259 ferritin 

gb:y»51h07.r1 Soares placenia Nb2HP Homo 
Hs.202247 ESTs 

Hs.271749 ESTs, Moderately similar to ALU8.HUMAN A 

gb:ye20g09.s1 Stratagene rung (937210) H 
Hs. 180383 dual specificity phosphatase 6 
Hs.202341 Homo sapiens cDNA: FU 23573 fis. clone L 
Hs.281927 ESTs 

Hs.339833 hypothetical protein FU11240 
Hs.10172 ESTs 

Hs.24444 Homo sapiens cDNA: RJ22165 fis, clone H 

Hs.6700 ESTs, Moderately similar to Z195_HUMAN 2 

Hs.257586 ESTs 

NM_000341:Homo sapiens solute carrier fa 
gb:nh34f0Zs1 NCI_CGAP_Pr3 Homo sapiens 

Hs.130293 ESTs 

Hs.220913 ESTs 

Hs.131323 ESTs 

Hs.61425 ESTs 

Hs.7146 ESTs 

Hs.119860 ESTs 

Hs.188614 ESTs 

Hs.55276 potassium voltage-g ated channel, Shal-re 
Hs.205980 ESTs 
Hs.1 65240 ESTs 

Target Exon 

Target Exon 

Hs.3261 59 leucine rich repeat {in Fill) interacts 
Hs.260180 Homo sapiens mRNA: cDNA DKFZp761G18121 ( 
Hs.210303 ESTs 

gb:PMO-HT0425-17010(M>02-a10 HT0425 Homo 
Hs.149424 Homo sapiens PNAS-130 mRNA, complete cds 

H^apiens mRNA for subteiomeric repeat s 
Hs.1 17323 ESTs 

Hs.184736 hypothetical protein FLJ10097 
Hs.103159 ESTs 
Hs.130722 ESTs 

Hs.188684 ESTs, Weakly similar to PC4259 ferritin 
Target Exon 
ESTs 

Target Exon 
gb:Human done mcag19 chromosome 16 CTG 
Hs.173091 ubiquitirvliXo 3 
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Pkey: Unique Eos probeset identifier number 
CAT number Gene duster number 
Accession: Genbank accession numbers 



Pkey CAT Number Accession 

408139 10421 1 AA451966NM 01 6370 AB036693AL1 39228 R581 24 AI634847AL11 9333 W07356A1334284 H29050AI192685AA652438 AW1 72843 W19794 

N21460 AI743862 AW130522 AA991348 Al 204 553 AA992664 N80848 AA699329 AI824676 R26624 R49653 

408255 1049351 1 AW807321 AW807262 AW177104 AW807319 AW8071 15 AW807344 AW807324 AW1781 16 BE141575 AW845849 AW807105 AW845868 
BE140942 AW807178 AW8071 67 AW807398 AW807320 AW807308 AW84S866 
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PC17US02/21338 



408283 1050275.1 



409126 
409282 
409291 
409367 
409688 
409692 
409894 
410154 
410357 
410642 
410725 
410730 
410744 



410956 
411008 
411026 
411141 
411184 
411188 
411245 
411347 
411432 

411496 
411567 
411590 
411608 
411652 
411880 
411918 
411920 



412085 
412252 
412274 
412298 
412302 
412679 
412902 

413035 
413068 
413196 
413346 
413444 
413489 
413593 
413645 
413656 
413750 
413885 
413888 
414051 
414623 
415054 
415110 
415157 
415361 
415378 
415449 
415770 
415806 
415855 
415989 
416311 
416463 
417086 
417117 
417324 



1085657.1 

110159.1 

111512.1 

1115745.1 

1123651J 

114831.3 

114869.1 

1157906.1 

117950.1 

1197159.1 

1213736.1 

1218207.1 

121847J 

1219485.1 



410840 1223800.1 



1227882_1 
1229027.1 
1229373.-1 
1233793.1 
1234977.1 
1235093.1 
1236412J 
1239834 J 
1245636.1 

1248073.1 
1249774.1 
125064 J 
1251259.1 
1252836.1 
1263110.1 
1265807.1 
1265812.1 



1276467.1 
1285293.1 
128647.1 
1288098.1 
1288128.1 
1321229.1 
1335166.1 

1346295.1 

1348104J 

135322.1 

136323.1 

1370831.1 

1373392 1 

137891.1 

138145.1 

1381628.1 

1386250.1 

1397288.1 

139750.1 

1413185.1 

1466952.1 

151827.1 

1522905.1 

1525616 1 

1534970.1 

1535274.1 

1537026.1 

155437.1 

155668.1 

155930.1 

156454.1 

158797.1 

1595999.1 

164830.1 

1651230 1 

166714.1 



BE141 579 AW807555 AW807502 BE141596 AW845B45 AW807500 AW845864 AW8O74B0 AW807486 AW807478 AW17B109 AWB07228 
AW807374 AW807125 AW846124 AW807470 AW807477 AW807510 AW807208 BE141573 AW807465 
AW277084 R27662 R26970 0791 94 

AA063426 AW962323 AW408063 AA063503 AA772927 AW753492 BE175371 AA31 1 147 

AW966480 AA069840 AA384646 

AW373472 AW373484 BE071899 BE071898 

AW382767 BE153835 BE153702 BE153572 

All 50485 AW938392 AA076894 AW883422 

A1500724 AA399661 AA397891 AI471084 A142351 1 T07531 AI094336 

BE081731 AW861738 AW503629 BE081969 

F06959Z43559AA082C02 

AW663614 BE046540 BE045760 AW827443 BE046544 
AW792784Z44444H06639 

AW799279 AW799395 AW799392 AW799276 AW799479 H57885 

AW368860 AA457091 AI903441 AA088B23 W88852 AW979154 AA826016 R94779 

H86002 W92289 AW801558 AW801324 AW801270 AW801307 AW801351 AW801357 AW801299 AW801609 AW801356 AW801420 
AW801425 AW80135B AW801429 AW801428 AW801427 AWB01424 AW801306 

AW806924 AW865537 AW856473 AW865298 AW866390 AW866478 AW866454 AW866309 AW866539 AW866521 AW866547 AW865517 
AW866403AW866369 

AW938322 AW93B307 AW938320 AW938323 AW81 1840 

AV^13238AW813474AW813334AVW16081 AVV813296AW813363AW813397AW813327AW81^ 
AW813786 

AW819561AW819682AW819563AW819688AW819499AW819498AW819690BE065081 
AW821 1 17 AW855541 AWB55405 AW855374 
BE161 168 BE1 62466 AW821 260 

AW833441 AW833552 AW83370O AW833610 AW833673 AW833675 

AW8381 26 AW838294 AW838247 AW838251 AW838292 AW838299 AW838374 

AW846272 AW846564 AW846545 AW846285 AW846135 AW846317 AW846200 AW846265 AWB46326 AW846196 AW846357 AW846153 
AW846286 AWB46319 AW846277 AW846381 AW846438 AWB46481 AW846352 
AW849241 AW849569 AW849243 

AW851630 AW851703 AW851735 AW851723 AW851708 AW851712 
T96183T64O70AA094134 

AWB53441 8E145228 BE145218 BE145162 BE145283 
AW855393 AW855560 AW855424 
AW872477 BE088101 T05990 

AW876354 AW876179 AW876318 AW876290 AW876234 AW876125 AW876199 AW876198 

AW876263 AW876257 AW876261 AW876273 AW876231 AW876398 AW876334 AW876134 AW876371 AW876308 AW876314 AW876328 
AW876169 AW876331 AW876426 AW876407 AW876412 AW876322 AW876363 AW876218 AW876240 AW876141 AW876138 AW876326 
AW876181 AW876131 AW876378AW876206 AW876191 AW876188AW876376 AW876289 AW876415AW876209 AW876366 AW876343 
AW876347AW876165 ' 
AWB91667H93096 

AW903782 AW903668 AW903672 AW903763 AW903784 
AA101443 R20332 F07484 
AW936300 AW936538 AW936386 
AW936334 AW936371 AW936474 
BE144762AW979091 

BE008018 BE008025 BE008026 BE007959 BE007994 BE008016 BE008019 BE008024 BE008022 BE008027 BE008029 BE008020 BE008015 
BE008021 BE008028BE008023BE008030BE008014 
BE155563 BE 155574 BE155556 BE061294 

BE063792 BE053803 BE063775 BE063801 BE063809 BE063777 BE063784 BE063808 BE066125 BE063782 BE063805 BE063812 

AA127386 R15644 AA127404 

AA1 28586 AA463667 AA235203 

BE141019BE141473BE141529 

BE144228 BE144291 

AA205248 AA130658 AA204737 

AA130992 AA503835 AW969537 

T91703 BE155222 BE155240 BE155274 BE155275 T92469 

BE161453 W28808 

BE177442 BE177439 BE177445 BE177440 BE177448 BE177444 BE177433 

AA580268 AA31 5655 M1 33031 AA377748 

BE244127BE246216 

BE391050BE389191 8E389697 

AI733907AA159708AI732614 

H04043 060988 D60337 

063257 078918 D632 14 

F06724 Z43690 R21681 

T16964 F07075H10256 

H15034T17195F09069 

M79237AI267298AA169260 

AA169560AA169840 

AI921875AA170835AA866613 

AI267700 AI720344 AA191424 AI023543 A1469633 AA1 72056 AW958465 AA172236 AW953397 AA355086 
D80529 D81719 C14833 AA1 79446 AA357794 
H59241 K57540 R00795 

AA 194446 AA1 94603 AA1 93162 AA 196396 Z24810 
N46778 W86339 N75221 

AW265494 AA455904 AA1 95677 AW265432 AW991605 AA456370 
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417527 
417670 
417882 
417890 
418056 
418797 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 



419145 
419386 
419501 
421338 
421582 



421718 
421926 
422259 
422297 
422337 
422444 
422857 
422936 
422940 
422959 



423074 
423622 
423736 
423977 
424060 

424160 
424290 
424951 
425288 
425303 
425702 
425724 
425757 
425B93 
425980 
426123 
426477 
428036 



168576.1 

1692163.1 

1705458.1 

1706141J 

171841J 

179095.1 

179717.1 

182217.1 

184356.1 

185354.1 

201378.1 

2041.1 



20595.1 

209246 1 

214322.1 

214851.1 

215447.1 

216595.1 

222190.1 

223099.1 

223106.1 

22327.3 



22470.1 
230333.1 
23151 1J 
23386BJ 
23490.1 

236151.1 
237857.1 
245273.1 
249569.1 
249780.1 
255206.1 
255509.1 
255956.1 
257742.1 
258778.1 
261304.1 
267804.1 
J 



AA203524 W88451 
R07785 T85948 T86972 
R22311R23982 R23997 
R79048 R23111 

AA524886 AW971347 AA21 1 537 
M515814 AA515037 AA230024 AA228343 
AA229558AA345492AA229582 

N93S38 AW973750 AA328271 H90994 AA558020 AA234435 N59599 R9481 5 
AA236867 AA237066 AA354236 AW957759 H08961 
AW843822 AW844020 AAB25923 AW968657 AW854173 BE084673 
AA287443 AA419385 BE084078 A1478347 

AI910275 X00474 X52003 X05030 NM.003225 AA314326 AA308400 AA506787 AA314825 AI571948 AA507595 AA614579 AA5B7613 R8381B 

AA568312 AA614409 AA307578 AI925552 AW950155 AI910083 M12075 BE074052 AW004668 AA578674 AA5B2084 BE074053 BE074126 

BE074140AA514776AA588034BEQ74051 BE074058AW009769 AW050690AA858276 R55389 AI001051 AW050700AW750216 AA614539 

BE074045 AI307407 AW602303 BE073575 AI202532 AA524242 AI970839 AI909751 BE076078 AI909749 R55292 

AL1 17574 AI681945 BE503055 AW015330 C16652 AA441855 AA329853 AW962502 AA441830 AF1 14049 

AA300591 AW963893 AA300493 

AA307584AW795791 AW795790 

AW961290 AA308338 R60841 BE378716 R13001 

R38572 R49597 H51730 F10468 AA309198 BE011B89 AA613236 

AA310688 AA355321 AW962134 

R71461 AA328744 AA318128 BE081817 BE081587 BE081751 
AA319278 AA319712 AW961 504 AW954956 
8E077458 AA337277 AA319285 

AV647015 AV6471 62 AA477047 AW392066 BE1 68052 AA657684 AL040747 AL043589 AL04161 8 W24630 AA41 9002 AA778650 AI028269 
AW378085AW780241 AI984255AW129462AI271476 AL040967 AA836991 AL040910 AW273346 AA1 32452 A1375545 A1436321 N53166 
AA598667 AA423804 AA889495 AI954063 AI923968 AW194576 AI400578 AA748499 AA860230 AW519209 AA339537 AA476935 AA767391 
A1017695 A1860419 N93437 AW452389 AI051533 AA806940 AA609569AI 624860 A1356952 A181 1516 A1439198 AA749268 AI206880 
AI784422 AI431 957 All 87038 AI678429 AI273421 AI358604 AI691155 AA586499 AA897667 AA524576 AJ417772 AW799936 BE241923 
AA013428 BE164987 AW602710 AI933540 AA757772 AW015132 AI954159 AW956377 W69468 AI807580 AI871746 AI922524 N80292 
A1676213 BE487086 N92405 AV65631 1 AI678172 AW339455 AI949278 AIB72882 AI1237B5 AA01 6267 A196 1623 AW1 6851 9 F28686 F37181 
AA908333 AW0207B8 AW374027 

AL109963 AA134692 BE273642 M78295 AI825179 AA228029 BE538677 AW589948 AW241564 AA732224 AA228030 AW977225 

BE154647BE154890AA328702 

AW936874 AW936923 AW936924 AA330274 

AA333232AW962695R54896 

X92108 AW295478 A1768675 AI086644 AW1 90160 AA31 3783 AA322741 AA354698 AW962099 AL044667 AA282648 AA973897 AL036586 

AA386055 AL135179 AW882199 

T74062 F12441 AA336207 BE185031 

AA338396 AW956247 AW753612 

AW964082AA348838AA348839 

AA354502 AA355215 AW962122 

AA354785 AA354792 AW958427 

N59555AA362113AA362136 

AA362525 AW979199 AA857501 

AA363171 AW963347 AA371863 

AA629695 AA365582 AA365581 AW959799 AW876559 AW876588 
AA366951 AA470999AA469425 
AA370352 AA370860 AW962776 
AA379464AA379611AA379463 

AW068302 AI754558 AI750727 AI752631 AA3021 74 AA327522 M641 10 AW859944 AW859989 AI751995 AA769820 AI858829 AI924875 
AI888836AA864291 Al 685060 AW088029 AI924908AW466328 A1093800 AA991651 AI254501 BE004703 AA334442AW938852 AA 194330 
AL046953 AA852866 AW391995 W30846 AW662928 W25261 AA042863 R99045 H97060 W03910 H94687 T88984 AL048165 T29632 N31556 
N36484 AI798679 AA989355 W23832 AA873789 A1743646 AA363587 AI814748 AW338990 N73740 N83666 AL047816 R24137 R63433 
AA524984 AA234043 AA1 95131 N99903 AA453669 AI240302 AA370271 AI950026 AW771049 AA1 21476 AA569557 AI752632 AI355594 
AI471993 A1159941 N94555 AI753138 N21 537 H97881 N25769 AW06B044 AA808425 R63380 AA384736 AA384738 AA852352 AI073645 
AA527960 AA525036 AA044414 A1752460 AA703064 R01216 AA897183 AI751996 TB1078 H95047 AA573642 058348 N20953 AA437143 
N95439 AA579540 AW867056 AA770090 AI085180 AI806799 AA426421 A1572513 R24081 AA8531 89 AA295620 AA234044 AA371020 
AW994984 H20896 AW964438 AA3 185 16 AA31 8499 AA31 8727 AA31B21 1 AA318478 AA318444 AA318307 AA318497 AA318448 AA31B309 
AA318496 AA318213 AA318435 AA31 8424 AA31 8217 AA31 8523 AA3 1 8438 AA31 8487 AA318724 AA593185 AW994985 T69842 AJ251B13 
AA478174 AA447737 T68350 F07712 AA1 21 145 H08973 AA345212 BE000667 AW06821 0 AW608407 R05674 H16712 N85426 N42354 
H85516 BE147991 T281 13 R32662 AA384678 AW239275 H82362 AW84O700 D58229 C04082 W45394 AW795667 R73973 BE002409 
AA042828 AA363555 AJ223B12 AA344709 BE149590 R70995 W46881 W90778 N71 242 AA534826 AL040676 R23797 H96450 AA062957 
D79947 W46960 AW959278 AA295997 AA02621 5 AW579469 AW365135 AW365134 AW994353 AW972886 AW069166 AA343690 AW888731 
AI751527 AA937490 AA937505 A1826715 BE465604 AI925532 AI858109 AW339097 A1858524 AI720571 BE046506 AW384981 AA043908 
AA375983 AA525181 AW058366 AW070577 AW891837 N83985AW1 82753 A1422979 AI679733 BE006555 AL048166 AI081401 AI888821 
A1626043 N37087 A1624140 AI801298 AA600048 A1753947 H8961 5 N66424 AW069756 A1814880 AI982806 AI754287 A1971 816 AW069022 
AW069069AW069454AA342989AI077712A1311467AI087361 AI801015W46993AI281324AW191963AI421675AJ300881 A1356670 
AA873156 AI0042 19 All 89685 AA47B018 AA076063 AI445222 A1753124 AI521569 AI925026 AI022368 A1475993 H20846 AI223234 AI635123 
AA579170 N30442 AW1 17889 AA807935 AA558975 A1306636 AA888963 AI952591 AI935835 AI445293 H1 671 3 AW139833 AA6221 22 
AA972051 AI280828 H09543 AI453725 AW06961 3 AJ86561 5 AI753921 AI3S87B2 AI633208 AI446651 W46961 N22201 H82276 C16555 
AA291477 AW440535 AW517755 AAS69921 AJ926777 AW6621 18 AA55 

BE045090 AI273006 C16390 C16503 AI620823 F1 3661 N66864 Z2131 1 C16108 C16089 C1 6400 AA758273 AI287781 AA864676 AW608074 
AW3855B3 AI589944 AA665817 AW192979 AW469065 AA564048 H84715 C16417 AA731072 AA661674 C16487 N29477 AW189997 
AI370492 C16471 AA652B09 AA936687 AA50651 2 C16306 AW028413 A1537935 AA528347 C16255 AW029046 C16202 AI868152 AI524662 
T94414 AI567041 AI619654 AW008486 AI075624 AA577434 AA345104 T30105 AA932002 C16585 AI750390 AW294265 AI619552 AA669781 
AA026678 AW1 32002 AW26391 9 C16562 AA759137 AA693351 Z40779 C1 6577 AA835045 AW073763 R45484 A1520895 U5470B T49285 
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428518 
429321 
429468 
430068 
430439 
431089 
431322 
431926 
432222 
432407 



432586 
433010 
433436 
434414 



434830 
435073 
436608 
437046 
437272 
437908 
438089 
438091 



438535 
438813 
439590 
439848 
440612 
440642 
441848 
443100 

443197 
443357 
444163 
444298 
444312 
444872 
44546B 
445555 
445689 
446780 
447008 



450717 
450724 
450735 
451724 
452565 
453577 
453823 
453826 
454457 
454523 
454609 
454629 
454631 
454639 
454662 
454701 
454721 



292383.1 

302955.1 

30498J 

312849.1 

31808J 

327825.1 

331543.1 

339082J 

343347.1 

34624.1 



350432.1 
357372J 
356107.1 
38585.1 



39406.1 

399701.1 

42361.3 

43210J 

435447.1 

445001.1 

44963.1 

44964.1 



45946.1 

46553.-1 

47413.1 

477806.1 

49847.1 

499098.1 

527227.1 

559752J 

56261J 

567506.1 

593658.1 

600160.1 

600647.1 

623948.1 

640681.1 

64334.1 

647829.1 

692897.1 

70358.1 

765247.1 

844561.1 

844585.1 

844617.1 

882130.1 

92237^.1 

972216.1 

982526.1 

982669.1 

1207274.1 

1221564.1 

1226517.1 

1227240.1 

1227443.1 

1227728.1 

1228537 1 

1229702.1 

1230747.1 



AI568126 AW006569 AI093317 AL1 19781 T61046 A1053563 H51958 AF1 14144 AA305739 AW950394 AW793928 AW793910 AL047737 

AV659047 AV659632 AI750389 AA092053 AA092798 H85367 T61597 R23745 Z20418 T78485 AI751 52B AW05B121 AA8531BB AI752459 

AA853711 AW950663 R78964 R36359 R21626 R21522 

AW969656 AA501412 AA905186 AA429703 AA431958 

AA449921 AA449922 C75309 

AF033579NMJX>5995 

AA464964 M854Q5 AA947566 

AL133561 AL041090 AL1174B1 AL1 22069 AW439292 AI968826 

BE041395AA491826AA621946AA715980AA666102 

AW970622 AA503009 AA502998 AA502989 AA502805 T92188 

AW972724 AA522631 AAB77998 

A1204995 AW827539 AW969908 AW440776 AA528756 

AA221036 RB7170 BE537068 BE544757 C18935 AW812058 T92565 AA227415 AA233942 AA223237 AA66B403 AA501627 AW869639 
BE061833 BE000620 AW961 170 AW847519 AA308542 AW821 833 AW945688 C04699 AA205504 AA377241 AW821667 AA055720 
AW8179B1 AW856468AA155719AA179928 T03007AW754298AA227407 AA113928 AA307904 C16859 
AA568548 AI050036 AA554053 AI91 1659 A1826259 
AW970018 AA573889 AA573622 R08736 
AW162474 AA588442 AI972440 

AI798376S46400AW811617AW811616W0O557BE142245AW858232AW861851 AW85B362AA232351 AA218567 AA055556 AW858231 

AW857541 AW814172 H66214 AW814398 AF134164 AA243093 AA 173345 AA199942 AA223384 AA227092 AA227080 T1 2379 AA092174 

T61 139 AA149776 M699829 AW879188 AW813567 AW813538 AI267168 AA157718 AA 1577 1 9 AA 100472 AA100774 AA130755 AA157705 

AA1 57730 AA 1577 15 AA053524 AW84958 1 AW854566 C05254AW882836 T92637AW8 12621 AA206583 AA209204 BE1 56909 AA226824 

AI829309 AW991 957 N66951 AA527374 H66215 AA045564 AI694265 H60808 AA149726 AW195620 BE081333 BE073424 AW817662 

AW817705AW817703AW817659BE081531 H59570 

AW852235AF156166R07008 

AA664078 AW363313 AA805009 

AA628980 A1126603 BE504035 

BE149154 BE149151 X60138 

AW975957 M747943 AA81 1289 

AI082424 AI740586 AA771764AA771B05 AI033879 BE500996 AW204531 038676 
W05391 X56197 H75313 AW770789 AA281642 AV654440 AA344646 R00244 T83378 

AW373062 T55662 A1299190 BE 174210 AW579001 H01811 W40186 R67100 AI923886 AW952164 AA628440 AW898607 AW898616 
AA709126 AW898628 AW898544 AA947932 AW898625 AW898622 AI276 125 Al 185720 AW510698 AA987230 T52522 BE467708 AW243400 
AW043642 Al 288245 Al 186932 D52654 D55017 052715 D52477 D53933 D54679 AI298739 AI1 46984 A1922204 N98343 BE174213 AA845571 
A1813854 AI214518 AI635262 AI139455 AI707807 AI698085 AW884528 AI024768 AI004723 AW087420 A1565133 N94954 AI268939 
AW513280 AI061 126 AI435818 AI85910B A1360506 AI024767 AA513019 AA757598 X56196 AA902959 AI334784 A1860794 AA010207 
AW890091 AW513771 AI951391 A1337671 T52499 AA890205 AI6409O8 H75966AA463487 AA358688 AJ961767 AI865295 AA780994 
AI985913 BE174196 AA029094 AW592159 T55581 N79072 A161 1201 AA910812 AI220713 AW149306 AI75841 2 AA045713 R79750 N76096 
L09078 L03145 L09094 L09098 L03165 L09102 
M27346 

AF086410W94386 W74609 
AW979249 063277 AAB46968 
BE561384AW732707 

AI744995AI973099 R19291 AA935215 AA897534 AI638070 BE179102 AI694912 
AWB50980 AA969613 AW366793 

AI033188 BE004743 AW804074 BE004795 BE178660 BE089438 BE089378 BE0B9439 BE089437 BE089423 BE089420 BE089425 BE089424 
BE089426 

Z43613 AA320191 AW954963 H15889 R83872 

AW016773 A1052778 AI452937 AW085293 

A1126098AM84746AI148521 

Z17870 AL039867 AI139927 

R44007 N54521 A1140476 AW418649 T95308 

A1936264 A1219228 AI20D740 BE300771 AI635739 Al 690626 AW044010 R60842 R40388 
AW450439 AW297340 A1239849 AI6131 1 9 
AW97401 3 AA557257 N36812 A1597998 AA873565 
BE158869 A1248241 BE158865 BE158868 
R31 107 AI341 136 AI653198 H04953 

BE01 01 89 AW879041 BE008038 AW905325 AA343575 AW844209 AL038020 BE0101 33 BE010083 AW903901 BE174377 AI309717 A1349651 

A1903726 BE176661 AW905394 

AI523875 R45782 R45781 

T94709 A1820675 AI732253 

R55428 AI820704 AI732283 R54983 

AI732321 R55640 R55639 A1820744 

AI903765 AI81 1 194 BE007147 AW130760 

BE066552 BE066341 A1907683 

A1043049 T95976 

AL137967BE064160BE064186 

AL138129 AL138179 BE064231 

AW753456 AW753036 AW854868 AW854862 

AW803980AW803974 

AW810204 AW810555 AW810196 AW810619 AW810507 

AW81 1 1 14 AW81 1095 AW81 1087 AW81 1 124 AW81 1054 AW81 1094 AW81 1 157 

AW811324AW811325AV^11326AVV811333AW811329AW811328AW811332AW811339AVV811335 

AW81 1 633 AW81 1652 AW81 1898 

AW812715 AW812646 AW812714 

AW854930 AW814431 AW814190AW814115 AW854941 

AVW15588 AW815671 AW815493 AVTC15898 AW815613 AVM^ 
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454741 1232559 1 BE 154396 AW8 17959 BE 154393 

454826 1 236377~1 AW833676 AW83381 4 AW833798 AW833677 AW833449 AW833630 AW833826 AW833444 AW833366 AW833791 AW833S59 AW833432 

AW833534 AW833556 AW833553 

454915 1242343 1 AW841619AW851958AW851851 AW851985 

455065 1251980 1 AW854352 AW85431 1 AW854340 AW854461 

455087 1252832*1 AW855389AW855556AW855420 

455116 1254206 1 AW857271 AW857308 AW857296 AW857258 

455238 1 265662J AW875972 AW875983 AW875974 AW876000 AW875966 AW876050 

455300 1276482J AW891707A 

455407 1288347 1 AW938813AW936731 AW936728AW936600AW936681 AW936651 

455508 1318507 1 AW976165 C04000 

455527 1322125 1 AW984479 AW9S4498 AWS84495 AW984477 AW984480 AW984504 

455642 1 348163" 1 BE063965 BE063968 BE064034 BE064028 BE063874 BE053966 BE063869 BE064043 BE064033 BE063884 

455649 1348708*1 BE065051 BE155165 BE064764 BE155231 BE084848 BE064671 BE064636 

455753 1356070*1 BE075124 BE075229 BE075278 

455757 1358657*1 BE079531 BE079371 BE079372 BE079593 BE079468 BE079504 BE078505 

455908 1382301 1 BE 156306 BE 156188 BE 156298 BE 156377 BE 156374 nc «™ 

455917 1382784*1 BE156765 BE156770 BE156767 BE156769 BE156803 BE156802 BE156847 8E156853 BE156780 BE156836 BE156792 BE156834 BE156779 

BE156789 BE156833 BE156844 BE156B31 BE156849 BE156797 BE1567B4 BE156801 BE156843 BE156793 BE156852 

455988 1397740 1 BE 177983 BE 178322 

456034 142696 1 AW450979 AA1 36653 AA 136656 AW41 9381 AA984358 AA492073 BE1 68945 AA809054 AW238038 BE01 1212 BE01 1359 BE011367 

BE011368 BE011362 BE011215 BE011365 BE01 1363 

456172 1603643 1 R99050 R99067 H67642 

456332 179104 1 AA228357 AW841786 AW841716 

456381 184123 1 AA236606 AA459341 AA237079 

457297 313764 1 AW968188 AA468196 AA468269 AA468298 

457389 331080 1 AW970989 AA502167 AA507546 ..^..^Aumflnn 

457567 357346 J AW939074 AW939073 BE1 60476 AW939938 AW939206 AW94001 2 AW939076 AA573577 AW750479 AA574383 AW970057 

457581 359936 1 AA57851 2 AA595535 BE 1 77533 _ 

457741 395767"l BE044740 AWB27360 AW827623 BE161439 BE044718 BE046207 BE046551 M653908 BE1665B1 

457871 426637*1 AI168278 AA868238 BE550792 A1522194 A1819707 AA973538 AI990086 AI628424 AI095270 A1991608 AA730741 

457951 44251 T U23860 U80739 

459234 945240_-1 AI940425 



TABLE 8C 



Unique number corresponding to an Eos probeset . 
Sequence source. The 7 digit numbers tn this column are Genbank Identifier (GQ numbers. 'Dunham I. et aL" refers to the publication entitled The DNA 
sequence of human chromosome 22." Dunham L et al, Nature (1999) 402:489495. 
Indicates DNA strand from which exons were predicted. 



Pkey: 
Ref: 

Strand: 

NLpositlon: Indicates nucleotide positions of predicted exons. 



Pkey 


Ref 


Strand 


NLposition 


400704 


8116864 


Minus 


63110-63241 


400830 


6570385 


Plus 


157683-163035 


400834 


8705192 


Plus 


121963-122288 


400840 


9188586 


Plus 


113882-114121 


400850 


1927150 


Minus 


4506-4691 


400881 


2842777 


Minus 


91446-91603,92123-92265 


401016 


8117441 


Plus 


126234-126359,128050-128236 


401090 


8492704 


Minus 


201281-201460 


401215 


9858408 


Rus 


103739-103919 


401241 


4827300 


Minus 


30503-30844,31055-31248 


401335 


9884881 


Plus 


15736-16352 


401381 


8570226 


Minus 


118629-119146,119392-119657 


401400 


7708226 


Minus 


33028-33585 


401469 


6682292 


Minus 


125521-125639 


401473 


7249001 


Plus 


115142-117305 


401577 


9280797 


Minus 


139377-139674,141195-141281,142217-142340 


401658 
401659 


9100664 


Plus 


89638-90028 


7689875 


Minus 


183379-183521 


401686 


6468551 


Plus 


5005-5426,6810-7042 


401723 


7656694 


Plus 


147273-147503 


401814 


7408052 


Plus 


136003-136726 


401818 


7467933 


Minus 


10964-11084,11674-11817 


401890 


8516144 
8698760 


Plus 


148955-149396,149569-150002 


401908 


Minus 


126888-127024 


401913 


9369520 


Minus 


33753-33904 


401927 


3873185 


Minus 


112000-112137 


402)25 


7547159 


Plus 


173835-173998 


402039 


7770432 


Rus 


560-1294 


402049 


6072512 


Plus 


100065-100419 


402085 


7249154 


Rus 


90533-90687.94949-95158 


402241 


7690131 


Minus 


125073-125206,130996-131125 


402294 


2282012 


Minus 


2575-3000 


402305 


7328724 


Plus 


40832-41382 
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402366 


9454515 


Plus 


195808-196863 


402551 


9B56793 


Minus 


37346-37633 


402654 


8076879 


Plus 


4405844803 


402685 


8318555 


Plus 


58962-59294 


402762 


9230904 


Minus 


123298-124035 


402797 


3421043 


Minus 


15758-15930 


402833 


8918545 


Plus 


26987-27778 


402901 


8894222 


Minus 


175426-175667 


402948 


936845B 


Minus 


143456-143626,143808-143935 


403066 


8954202 


Plus 


158189-158433 


403072 


8954241 


Plus 


141829-142006 


403146 


9799812 


Plus 


162877-163118 


403197 


9930749 


Plus 


79990*0237 


403214 


7630945 


Minus 


76723-77027,79317-79484 


403217 


7630959 


Plus 


54089-54163,55427-55623 


403290 


8083176 


Plus 


19288-20076 


403291 


7230870 


Plus 


95177-95435 


403294 


8096496 


Plus 


41565-41881 


403315 


8247953 


Minus 


125117-125287 


403332 


8568139 


Minus 


31409-31674 


403344 


8569726 


Plus 


70823-70990 


403362 


8571772 


Plus 


64099-64260 


403371 


9087278 


Plus 


105655-106050 


403391 


9438337 


Plus 


42410-42544,8331 7-83540,86840-86922.87970-881 1 0 


403488 


9966615 


Minus 


12450-12753 


403536 


8076924 


Plus 


34972-35182 


403779 


8018040 


Minus 


95602-95969 


403859 


7708954 


Plus 


113738-113858 


403871 


7709262 


Plus 


104545-104757 


403903 


7710671 


Minus 


101165-102597 


403917 


7710849 


Plus 


109718-109847,109927-110202 


403978 


8576014 


Plus 


97326-97808 ' 


404031 


7671252 


Plus 


171477-172316 


404167 


9926594 


Minus 


77030-77280 


404220 


6706820 


Plus 


46107-46439 


404286 


2326514 


Plus 


51086-51301 


404418 


7382420 


Minus 


153339-153481.155099-155294 


404427 


7407959 


Plus 


127170-127358 


404429 


7407979 


Plus 


31352-31498 


404440 


7528051 


Plus 


80430-81581 


404495 


8151634 


Minus 


59449-60477 


404580 


6539738 


Minus 


240588-241589 


404606 


9212936 


Minus 


22310-23269 


404730 


8389582 


Plus 


119832-120016,124110-124275 


404917 


7341851 


Plus 


49330-49498 


405033 


7107731 


Minus 


142358-142546 


405137 


8570507 


Plus 


158969-159423 


405146 


9438278 


Minus 


102529-102633 


405158 


9966252 


Plus 


42873-43056,4381543949 


405187 


7229826 


Pius 


117025-117170,118567-118736 


405223 


7239614 


Plus 


106184-106313 


405340 


6094635 


Plus 


49644-49760 


405494 


8050952 


Minus 


70284-70518 


405551 


1552506 


Plus 


12525-12997 


405654 


4895155 


Minus 


53624-53759 


405667 


4726099 


Plus 


5798-5914 


405673 


4589984 


Plus 


50700-50842 


405704 


4204244 


Pius 


136842-139051 


405723 


9801668 


Plus 


114896-115831 


405727 


9838331 


Minus 


76865-79664 


405760 


6066938 


Minus 


37424-38045 


405779 


7280331 


Minus 


33048-33856 


405944 


7883702 


Minus 


5143-5684 


406002 


8247797 


Minus 


154007-154579 


406016 


8272661 


Plus 


41341-41940 


406097 


7107918 


Minus 


35698-37269 


406104 


9124028 


Plus 


35309-35977 


406156 


7144867 


Plus 


379-597 


406207 


5923650 


Minus 


162607-162800 


406300 


6479046 


Minus 


19234-19401 


405308 


9211532 


Plus 


358408-358651 


406314 


9211609 


Minus 


12899-13011,18022-18138 


406317 


9211652 


Plus 


108018-108410 


406432 


9256504 


Plus 


3804-3930,40264120,4929-5109 


406490 


7711309 


Minus 


8029540480 


406584 


3983530 


Minus 


39894497 
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TABLE 9A: Genes predictive of no bladder cancer progression ___ 

Pkey: Unique Eos prooeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnlgenelO: Unigene number 

Unigsne Title: Unigene gene lite 

R1 80th percenEe of Ta or T1 rumor Ats from patients who did not upstage divided by the 80th percentile of Te or T1 tumor Als from patients who did upstage 
R? median of Ta or Tl tumor Als from patients who did not upstage divided by the median of Ta or T1 tumor Als from patients who upstaged 



Pkey 


ExAccn 


UnigenelD 


Unigene Title 


R1 


R2 


408000 


111690 


Hs.198689 


bullous pemphigoid antigen 1 (23Q/240kO) 


4.64 


5.88 


412129 


M21984 


Ms.73454 


troponin T3, skeletal, fast 


4.54 


5.10 


459290 


NM.001546 


Hs.34853 


Inhibitor of ONA binding 4, dominant neg 


4.37 


1.63 


400844 






NM_00310S':Homo sapiens sortitirwelated 


3.69 


5.90 


419555 


AA244416 




gb:nc07d11.s1 NCI.CGAP.Prl Homo sapiens 


3.61 


Z03 


414522 


AW518944 


Hs.76325 


step II spOcing factor SLU7 


3160 


1.00 


440509 


BE410132 


Hs.134202 


ESTs, Weakly similar ta T17279 hypotheti 


3.58 


1.04 


445182 


AW189787 




ESTs 


as7 


Z70 


407151 


H25836 


Hs.301527 


ESTs, Moderately similar to unknown [H,s 


3.56 


1.48 


421314 


BE440002 


Hs.180324 


Homo sapiens, clone 1MAGE.-41B3312, mRNA, 


&28 


a25 


42S663 


M68874 


Hs.21t587 


phospholipase A2, group fVA (cytosoiic, 


3.25 


250 


430702 


U56979 


Hs.278568 


H factor 1 (complement) 


2.20 


270 


412420 


AL035668 


Hs.73853 


bone morphogenetic protein 2 


3.20 


230 


420729 


AW964897 


Hs.290825 


ESTs 


3.20 


1.53 


433376 


A1249361 


Hs.74122 


caspase 4, a poptosis -related cysteine pr 


3.00 


4.10 


420028 


AB014680 


Hs.8786 


carbohydrate (N-acely!g!ixx>sarnine-6-0) s 


294 


Z78 


407881 


AW072003 


Hs.40968 


heparan sulfate (glucosamine) 3-O-sulfol 


2.93 


1.43 


426283 


NrvL003937 


Hs.169139 


kynureninase (L-kynurenine hydrolase) 


2.93 


1.33 


428030 


AI915228 


Hs.11493 


Homo sapiens cDNA FU 13536 lis, clone PL 


2.92 


247 


419713 


AW968058 


Hs.92381 


nudlx {nucleoside diphosphate linked mo! 


289 


3.33 


414407 


AA147026 


Hs.76704 


ESTs 


2.87 


Z87 


450779 


AW204145 


Hs.156044 


ESTs 


2.78 


1.86 


411243 


AB039886 


Hs.69319 


CA11 


273 


1.00 


417876 


U90916 


Hs.82845 


Homo sapiens cONA: FU21930 hs, clone H 


2.68 


2.32 


441519 


NM.014056 


Hs.7917 


DKFZP564K247 protein 


2.67 


298 


446619 


AU076643 


Hs.313 


secreted phosphoprotein 1 (osleoponfin, 


2.66 


1.00 


440006 


AK000517 


Hs.6844 


hypothetical protein FU20510 


2.64 


1.77 


426252 


BE176980 


Hs.28917 


ESTs 
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gb^}33f0ar1 Scams placenta Nb2HP Homo 
ATP synthase, H transporting, mitochondr 
RNA (guanine-7-) methylfransferase 
FEM-1 (Celegans) homotog b 
OKF2P434J214 protein 
NM_001093*:Homo sapiens acetyl-Coenzyme 
K1AA1 116 protein 
CGM48 protein 

ESTs. Weakly similar to ISHUSS protein d 
hypothetical protein DKFZp5S4K142 
transcription factor 7-Tike 2 (T-cefl sp 
amyto-1,6-g!ucosidase, 4-afpha-gfucanotr 
DKFZP564G0222 protein 
NSI-associated protein 1 
rec 

hypothetical protein MGC5370 
Eos Control 
synaptotagmin I 
Target Exon 

gb:EST386776 MAGE resequences, MAGM Homo 
gb:601118231F1 N1H.MGCJ7 Homo sapiens c 
sorcin 

E74-Hke factor 2 (ets domain transcript 
Target Exon 

hypothetical protein similar to ankyrin 
hypothetical protein AF1 40225 
Cl0000818-:gil7661B82|reflNP.055697.1l K 
Homo sapiens cDNA FU14175 fis, done NT 
ESTs 

patched related protein translocated to 
sucrinate-CoAflgase, GDP-fonming, beta 
K1AA1457 protein 

gb:RC3-BN0034-2902QO-013408 BN0034 Homo 
Homo sapiens mRNA; cDNA DKFZp434I0812 (f 
protein tyrosine phosphatase, receptor t 
NMJ)14080:Homo sapiens dual oxkJase-like 
ESTs 

hypothelicat protein FU14681 
collagen, type IV, alpha 5 (Alport syndr 
ESTs 

F-box only protein 22 
prolyl endopeptidase 
differentially expressed in hematopoiet) 
cytochrome b5 reductase 1 (B5R.1) 
hypothetical protein FU13187 
hypothetical protein 669 
tubby super-family protein 
desmocollin 3 

Homo sapiens cDNA FU14415 fis, clone HE 
transdudn (beta}-like 2 
ESTs, Weakly similar to T18967 hypothell 
Ba2/adenovirus E1B 19kD-interactir»g pro 
guanine nucleotide binding protein 10 
NM_003105*:Homo sapiens sortilirvrelated 
Homo sapiens cDNA: FU23034 fis, done L 
ESTs, Weakly similar to A46302 PTB-assoc 
laminin, beta 3 (nicein (125kD), kalinbi 
glutamate-cystelne ligase, catalytic sub 
Target Exon 

gb.7dOdO8.sl Stratagene lung (937210) H 
EST, Weakly similar to 138022 hypothetic 
ESTs 

ninein (GSK3S interacting protein} 
CEGP1 protein 
desmoglein 2 

lectin, galactoside-binding, soluble, 4 
ESTs, Weakly simitar to dJ 403A1 5.3 IH.sa 
ESTs, Weakly similar to A47582 bceSi gr 
hypothetical protein FU22604 
CCR4-NOT transcriplion complex, subunit 
three-PDZ containing protein similar to 
B-oell ClL/rymphoma 10 
Homo sapiens cDNA FU1361 8 fis, done PL 
ATP synthase, H transporting, mftochondr 
transJocase of cuter mitochondrial membr 
gb:43d9 Human retina cDNA randomly prime 
ESTs 
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Hs.1384 


O-6-melhylguanine-OMA me thy! transferase 
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9 15 


451234 


A1914901 
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ESTs, Weakly similar to 138022 hypotheti 
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2.43 


423332 


A1091466 


Hs,1 27241 


sorting nextn 7 


1.76 
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SH3-oonlaIning protein SH3GLB1 
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AJ697990 


Hs.346002 


ESTs 
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3.15 
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Hs.225967 


KIAA0995 protein 
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weel (S. pomba) homolog 
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Hs.334309 
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ESTs 
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ESTs 


1.74 


2.02 


445537 


AJ245671 


Hs.12844 
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Hs.82043 
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gb:UI-HF^R0p-ajv-oW.UI.r1 NIH_MGC_5 
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2.63 


411442 


N25956 
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Homo sapiens cDNA FU 14232 lis, done NT 


1.72 


1.68 


400846 






scrtilirwelated receptor, 1{0LR class) 
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hypothetical protein 
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hypothetical protein FU22995 
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ESTs 
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ESTs 
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troponin C slow 


1.67 
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ESTs 
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1.66 


1.97 


443558 


AA376798 


Hs^86122 


MOS024 protein 
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Hs.48938 


hypothetical protein FU21 802 
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BE295568 


Hs.13225 


428172 
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421887 


AW161450 
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418127 


BE243982 


Hs.83532 


400297 


AJ127076 
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434938 
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Hs.82932 


418067 
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Hs.83393 
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Hs.22265 


451938 


AI354355 


Hs.16697 


407325 


AA291180 
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Z44547 
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nuclear transport factor 2 (placenta) pr 
NM.003 1 05*:Homo sapiens sortilirwelated 
TP53 target gene 1 
Target Exon 

profine synthetase co-transcribed (bade 
KIAA1494 protein 

myetoioTrymphoid or mixed-lineage teuton 

ubtquitln-conjugating enzyme E2E 3 (homo 

HE4; epWidymls-specific, whey-addle pr 

contaciln 3 (plasmacytoma associated) 

hypothetical protein MGC4248 

lipoic acid synthetase 

upstream binding protein 1 (LBP-1a) 

KiAA0560gene product 

pleckstrin homology domain-containing, f 

CGI-110 protein 

ESTs, Weakly similar to S65657 aipha-lC- 
ras homolog gene family, member E 
Homo sapiens, Similar to RIKENcDNA 1810 
zinc finger protein 258 

Homo sapiens, clone MGC: 15763, mRNA, com 

postsynaptic protein CRIPT 

Wdney- and liver-specific gene 

nuclear receptor co-repressor/HDAC3 comp 

prostagiandin-endoperoxide synthase 2 (p 

ESTs 

a disintegrin and metailoproteinase doma 
solute carrier family 30 (zinc transport 
NM_022041*:Homo sapiens giant axonal neu 
vacuolar protein sorting 35 (yeast homol 
C6000994*:gi|10435784|db]|BAB14568.1| (A 
mitochondrial ribosomal protein L3 
neuronatin 

glioblastoma amplified sequence 
protein tyrosine phosphatase, receptor t 
C5001831:gi|11056014|ref|NP.06765t1| ac 
eukaryotic translation elongation factor 
Homo sapiens cDNA FU14059 fis. done HE 
Human DN A sequence from clone RP11-145L2 
ESTs 

C7000609':gi|628012|pirlJA53933 myosin I 
STAM-like protein containing SH3 and ITA 
ESTs. Weakly similar to I38022 hypotheti 
hypothetical protein FU 13224 
ubiquitln specific protease 9, Xchromos 
hypothetical protein FU12553 
Target Exon 
netrln 4 

tumor protein 63 kDa with strong homolog 
ESTs 

ESTs, Weakly similar to spiiceosomal pro 
Homo sapiens cDNA FU12961 fis, ctone NT 
hydroxyprostaglandin dehydrogenase 15-(N 
ESTs, Weakly similar to AF208855 1 BM-01 
hypothetical protein FU20607 
ESTs 

ESTs, Moderately similar to ALUC.HUMAN ! 
ESTs, Moderately similar to ALU7.HUMAN A 
filamln A, alpha (actin-binding protein- 
ESTs 
ephrin-A2 

Immunogtobuiin kappa constant 
Homo sapiens cDNA FU 11 346 fis, done PL 
hypothetical protein FU20160 
NMJ)03105\Homo sapiens sortiruwelated 
UDP-Gat:betaGlcNAc beta 1.4- gatactosylt 
zinc finger protein 136 (done pHZ-20) 
CQ-85 protein 

membrane cofactor protein (CD46, trophob 
hypothetical protein DKFZp5640l278 
Homo sapiens, clone MGC:16169, mRNA, com 
cydin 01 (PRAD1: parathyroid adenomatos 
cystatinE/M 

pyruvate dehydrogenase phosphatase 
down-regulator of transcription 1, TBP-b 
ESTs, Weakly similar to alternatively sp 
ESTs, Moderately simitar to 138022 hypot 

203 



1.68 


2.88 


1.66 


1.61 


1.66 


2.63 


1.66 


2.34 


1.65 


2.08 


1.65 


3.13 


1.65 


1.37 


1.65 


2.35 


1.65 


1.00 


1.65 


1.44 


1.65 


235 


1.65 


200 


1.65 


2.33 


1.65 


213 


1.65 


1.63 


1.64 


1.28 


1.64 


1.60 


1.64 


1.91 


1.64 


2.04 


1.64 


1.53 


1.64 


2.05 


1,64 


230 


1.64 


2.05 


1.64 


2.26 


1.64 


210 


1.64 


Z05 


1.64 


1.00 


1.64 


2.16 


1.64 


2.73 


1.64 


2.75 


1.63 


2.04 


1.63 


2.64 


1.63 


0.96 


1.63 


3.45 


1.63 


1.39 


1.62 


2.11 


1.62 


2.63 


1.62 


2.19 


1.62 


3.03 


1.62 


279 


1.62 


2.55 


1.62 


2.48 


1.62 


230 


1.62 


2.17 


1.62 


2.03 


1.62 


2.25 


1.62 


2.15 


1.62 


1.00 


1.62 


1.74 


1.61 


2.16 


1.61 


2.74 


1.61 


2.59 


1.61 


1.78 


1.61 


2.10 


1.61 


1.75 


1.61 


2.41 


1.60 


3.05 


1.60 


2.63 


1.60 


2.43 


1.60 


2.69 


1.60 


1.49 


1.60 


1.08 


1.60 


1.47 


1.60 


O QQ 


1.60 


2.57 


1.60 


2.03 


1.60 


2.68 


1.59 


1.39 


1.59 


1.67 


1.59 


2.19 


1.59 


226 


1.59 


1.76 


1.59 


1.26 


1.59 


2.25 


1.59 


210 


1.58 


2.43 


1.58 


1.26 



WO 03/003906 



PCT/US02/21338 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 



417343 


AA197132 


Hs 231581 


rnvrviln hftaw oohnfiniida 1 Kltnlotsl mil 
lujuduk, Heavy pwTpcpuuo i| onowuii inu 


1.58 


9 DA 


416643 


U62531 


Hs.79410 


sotuts csnfer fsmOy 4, snlon exchanger 


1.58 


I. to 


400847 






MM 0031 05* 'Homo fianiftnc cnrtinn-rplntpfi 


1.58 


l.**0 


436760 


AW606927 


Hs.5306 


hypolhefica! protein OKFZp586F1 1 22 slmD 


1.57 


1 57 


433427 


AI816449 


Hs. 171889 


ctoCn8phosphol/3nsfex3S8 1 


1.57 


164 


451986 


BE246996 


Hs.318401 


hypo Ihe tic ai protein DKFZp564D1378 


1.57 


1 R3 
1.03 


428901 


A1929568 


Hs. 145668 


KIAA1253 nmlffin 


1.57 


9 


426028 


NM-001110 


Hs. 172028 


a distnteorln and mptaDnnmfplnpcp rlnmn 


1.57 


^07 


444604 


AW327695 


Hs. 11441 


rhmmrKrtmft 1 rtnpn rpadlnn framo fi 
w u m i now i id i upcii i coumy iicauc 0 


1.57 


1 RR 
l.OQ 


439666 


W40445 


Hs 935857 


PSTs Wesklv similar \a I3AD99 hvmlheli 


1.57 


q ivy 
O.w 


426996 


AW968934 


Hs 173108 

iwi nw ivo 


Homo saoians cDMA: FL191RQ7 ft? dnna H 


1.57 


9 01 


447343 


AA256641 


Hs 236894 


ESTs Hiohlv similar fo a!nha-9^n 


1.57 


9 R.1 


418942 


A) 566004 


Hs U125Q 


nulliU ba^Hclu IrUlvv rUi lOOV ltd, WOtUJ V 


1 57 


1 91 


416555 


AI417215 




hvnn(hf>(!ral nmtnin Fi M9177 
nypuu iuuvcu piuioui rui ioji i 


1.56 


3,08 








MM D91 1 55**Mnmn canipne m9AQ flnKnpn tC 
riPHijjc i (33 .norno sapiens KjUcwj anuyen \\i 


1 56 


9 


419749 


X73808 


Hs.93029 


crtarr/mtpnnpHtn cwrv flnrl k^aUikA H 


1.56 


2.08 


404977 






IniiuDnJ/Ta? omwih factor 2 /eftfn?rtr>mprfl 


t56 


5.50 


441872 


BE567100 


Hs 154938 


hvnoihetjcal orotaln MDS095 

liy^MMJ leUwOI ^rwlOil "lUywW 


1.56 


2.30 


415503 


U36601 


Hs 78473 


M^aacatvtasfi/N-siilfo^ransfefasfl /hanara 


1.56 


2.56 


451743 


AW074266 


Hs 23071 


ESTs 


1.56 


1.85 


423184 


NM 004428 


Hs.1624 


ephrin-A1 


1.56 


141 


408041 


AW1 38782 


Hs 943S07 


ESTs 


1.56 


2.21 


416777 


AF14G760 


H«;7Q844 


DKF7P5G4M141R nmtpin 


1.56 


ZOO 


428013 


AF15M9fl 


Ue 1fi14Ad 


hi/nnfhpffcal nrnloln 
iijpuuiBuwai piutciii 


1.56 


1.53 


410072 


RF3BAiA7 

DC«J0 4 r i T J T# 


ns. iouj*i 


hvnnthptirsl nmWn Mfini^lflR 
iiypuuicuuoi jjiuicui ivtuvio loo 


1,55 


1.52 


41 1495 


ATUUU03J 


ns. r U 003 


tMAAU 1 00 pXOlcUl 


1.55 


2.88 


Anfl1fi9 
*moio< 


AAJS00OO 


Lie 11R597 


CO IS 


1.55 


9 7A 


*t I003U 


UW330 


He 75W7 


HAinipiQA*oSSuciawU"^us-mpr%stM i- 


1.55 


1.99 




MAJU&U43 


U e 'SI 1Q1 
nS.O ' IO I 


nOf NO OPpieJlS UUIvv r LJZOtOU IIS/ dUiK3 V 


1.55 


1.60 




At |f>7fiQfi1 
AUUf 030 1 


Ue 1CR919 

ns. ioocii 


molhulmolrinul PnoriTump A mulaep 

iiftjuiyiiiioiunyi v^uci liyiiio a rnuiaso 


1.55 


2.57 


A9MRA 


OC97QOQP 
C-Ctr O&OO 


ns. 1 00U40 


LUuicioJi wan group \Auuejyta o oiupycii 


1.55 


1.45 


1IBUI1 




nb.D300£ 


nliifnfhlnnp ^.trancfpr9<:n A 9 
yiUldUllunc 0~UalioiUidSU nt 


1.55 


2.77 




AVVU3U0QO 


He; 975711 


hvnn^haKrjH nrnlain M(?r9d59 
nypuuKsuboi piuiciii iviov£HO£ 


1.55 


Z76 


xnonnfi 

4UyoUO 


AiA/4nftQfin 
Awouuyou 




gD.uwr«orup*aiy"i/ , vi'0-ui.r i iNin_jviv3w_3 


1.55 


2.45 








Tamo) Fvnn 


1*54 


2.58 




All 34UI 1 


ns. f Oto 


CO (5 


1.54 


1.00 


4IUUU I 


ADW 1 U JO 


u e R7771 
no, Of I f 1 


trallllf rpin 1 1 


1*54 


0.62 


4U/01 3 


Al 1909 A7 
AL ItUtMf 


Utc AH1 HQ 
ns.**u 1U3 


klAAflfl79 nmfptn 
maauo f l proicin 


1*54 


2.33 


415906 


AI75H57 


Up 9PQ7d1 


Homo saniens cDMA' FL 12?? 56 Ms clone H 


1.54 


2.77 




AA417nR3 


Ur mA7RQ 
ns. iu<t» 03 


ESTs 


1.54 


2.60 


40(010 


AA003U/0 


U e 1P7n^7 

ns. io/uoo 


COI5 


1.54 


2.48 


413U4;! 


KJR7TJA 
INQf 031 


ns.ou too 


CO 18 


1.54 


2.57 


4«0 10 


U 10«0 


Ue 11C17R 

ns. i 131/0 


1 -Cell lyfTipnOiiia invdolvn aTIU niolaSidSJS 


1.54 


2.57 


410f 10 


AWM1191 
AVVOOI 111 . 


Ue 7CAQ7 

ns. /o**3# 


Hamn canlAn^ rHMA' FI 1991 39 Rc rJnnp H 
nutiiu sa^Mciis wuiw ruiu ioj list uwc n 


1.54 


1.98 


AA7\AA 

44/ w 


AIRV175Q 
AI00U(33 


Ur 17AR1 

ns. i/idi 


Wnmn eanipnQ rJnnp 9/tRflfi mRN A ^pnnpnpp 
nmiiu sapiens wuuc **touv uir\iv> oc^ucn\/o 


1.54 


2.48 


438924 


DtOOOOll 




Ue^smernorarie uaiiicwrty protein 


1.53 


3.08 


449100 


AI030I 10 


nS.14« 101 


to IS 


1.53 


2.08 


4l4Uf J 


AT UOO tJJ 


ns.r of j/ 


nprirpntrinlar mataria) 1 


1.53 


1.70 


4UZo7o 






TArnnl CyAfl 

i aryei ciMjii 


1.53 


2.83 


431010 


AAty04D4 


Ue R19RC 

ns.onoo 


CCTe fJnrtprAMv similar fn <^7flfi HUMAN P 
tola, fvU/UcJcuay sninioi KJ V3/oo_nuivirti» r 


153 


1.60 


4&UO/4 


AAOSf 3**U 


Ur Rft9Q'l 


U<vnn eantonc rlnnn 199AR9 tinknnwn mRNA 


1.53 


3.59 


402617 






Pi (Vnwi >nilRR7R5Q?lrafIMP 0^547 1 i win 
\j lUUJOO l.yiJOOf OOSOIIolllXr^ .VIOOOH/ . I| Will 


1.53 


2.75 




R70292 


Ue Infill fl 

ns. i od i iv 




153 


1.01 


410573 


ArlOlUOf 


Ue C/ICOC 
nS.0433D 


ts*m\f%j\*+jt\tv*ili± ettfwIglrfpiwHft Hphxinmnana^p. 
anuno30ip3i^serni«ucnyuB uciiyuiuyyjiaau- 


1.53 


1.23 


q/oooa 


AA17fidflQ 
AHO/OHUS 


Ue inpR9 
ns. luooz 


Hnmft <anipris cDNA' FLJ23313 fis clone H 


1.53 


0.67 


A1AAAZ. 
404440 


AJOH30UO 


Ue 117R9 

ns. 1 1 / at 


CCfe 
CO IS 


1,53 


2-80 




AV¥ I DUO 33 


ns.ouj/o 


nypoinsucai piuiein 


153 


201 


4 AW DO 


Ai nnn97R 

ALUOUi/O 


Ue 7fURn 


ctmilor 1a nmltan/nfln-ivrtfi Has*; 1 oenl 


1.53 


2.25 


4J74U4 


AAOOOa/4 


Ur iftnacrt 
nS.10U33^ 


FQTe 
Colo 


1.53 


2.00 




AvVI/OlOu 




/ThRP9-RTn914JllMoo.001-E07 BT0214 Homo 


1.52 


3.20 


446457 


AlOUUDOv 


HS.o4o/ol 


CCTe MnHprainlw ctmtlar fn At U1 HUMAN A 
co IS, wioueraiuiy &«iuiar io alu i^numnn n 


1.52 


2.35 


441466 


AWOf OUO 1 


Ue C/090 


CCTe 

cols 


1.52 


1.99 


421810 


AUmi710 
MiUJUlf 10 




hunnfholl^l nrrrfpln FI M0fl5fi 

nypouicucai proiem ru ivooo 


152 


2.98 


447769 


AW873704 


Hs.320831 


Homo sapiens cDKA FU 14597 fis, done NT 


152 


2A7 


414882 


D79394 


Hs.77546 


Homo sapiens cONA: FU21983 fis, clone H 


1.52 


2.55 


442169 


W21813 


HS.B125 


Homo sapiens mRNA cDNA DKFZp586E1 521 (f 


1.52 


1.31 


404349 






Target Exon 


1.52 


Z74 


416278 


AA35635B 


Hs.79137 


protein-Usoasparlate (D-aspartale) 0-m 


1.52 


2.93 


431846 


BE019924 


Hs.271580 


uroptaWnlB 


152 


1.01 


431958 


X63629 


Hs.2877 


cadharin 3, type 1, P-cadherin (placenta 


1.52 


0.93 


442670 


BE410050 


Hs,11859 


hypothetical protein FU13188 


152 


270 


441617 


AA5818G3 


Hs.17B48$ 


Homo sapiens cONA FU13919 fis, done Y7 


1.52 


1.65 


440079 


A1557284 


Ks.6900 


ring finger protein 13 


152 


1.76 


432831 


AJ821702 


Hs.115959 


ESTs, Weakly similar to 138022 hypotheO 


1.52 


2.13 


414320 


U13616 


Hs.75893 


ankyrin 3. node of Ranvier (ankyrin G) 


152 


213 


442149 


AB014550 


HsJ118 


K1AA0650 protein 


152 


1.00 


457747 


AW975000 




gb:EST387105 MAGE resequences, MAGN Homo 


1.51 


238 



204 



WO 03/003906 



PCT/US02/21338 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 



419433 


AA814807 


Hs.7395 


431612 


AA515902 


Hs 130650 

i W» 1 O WWU 


415477 


NM.002228 


Hs.78465 


4475B0 


AI953360 


Hs. 133487 


416926 


H03109 


h<; 108920 


442755 


W57656 


Hs 1 09701 


448694 


AA47B756 


Hs. 194477 


422675 


BE01B517 


Hs 119140 

lld> 1191 *tU 


404397 






412927 


AA284018 


Hs.75083 


402371 






431730 


AF208856 


Hs 268122 


417715 


AW9695B7 




451117 


AA015752 


Hs 205173 


434727 


H43374 


Hs.7890 


442297 


NM 0D6202 


80901 


425683 


AL 137708 


Hs 161031 


452658 


N88604 


Hs 30212 


428695 


AI355647 


Hs 189999 


438967 


H30340 


Hs 173705 


419847 


AW39Q601 

ril » ww www l 


Hs.184544 


431369 


BE1 84455 


Hs 251754 


433265 


AB 04 0971 


Hs 35096 

1 MiWW'VW'V 


408136 


ALC41 135 


Hs.42959 


455485 


AA1022B7 


Hs.26756 


418863 


AL1 35743 


Hs 255S6 


405193 






408948 


AW296713 

nMLwyi Iw 


Hs. 221 441 


426086 


AF038007 


Hs 166196 


405932 






454034 


NM.O0O691 


Hs.575 


422355 


AW403724 


Hs 3QD697 


428044 


AA093322 


Hs 301404 


416166 


AW501907 


Hs 281734 


430453 


RP387060 


Hs.3903 


401600 


RF247275 






AIM7717 
nlUlr 1 1 1 










416179 


R 1 90 1 5 


Ue 7Qnfi7 


450272 


A1075170 


Hs 20010 


/II "57/10 


DC (3000/ 






nnSUf Of 0 


Ue 3RA3fi1 
ruj.cOO.JU 1 


410418 


D31382 


Hs 63325 


/■3ecee 


BE545586 


Hs 278712 


*tw9/Dj 






4201 V) 


RP07QR47 


H«; 301914 


44JH5R 


AL1 20837 


Hs 20993 


IDtO 


AU213J7 








H«; 98135 


440029 


IN £03 03 


Hs 22891 






H<; 111RR7 


•WO/ l<3 


n 1 Q9U£ 










419125 


A 

AAW*4D4 


ns. 1 ouoo 1 






Hs.920 






H<; 107882 


HINDOO 


S78296 


Hs 76888 


497ftAR 


fflHwlW 


Hs 173328 


*WOO*K> 




H* 2R192 




AM9fiflAi; 


Hs.97726 


i44yn 


A1R7Q701 


Ue 145409 




f\i litoy 1 


ns. 1 0 too 








453827 


AF201948 


Hs.35660 


423599 


AJ805654 


Hs.31731 


410691 


AW239226 


Hs.65450 


430688 


AL022101 


Hs.104991 


438083 


AI949940 


Hs.1 21924 


430713 


AA351647 


Hs^642 


437325 


AF142481 


Hs.5548 


403342 






438808 


M73980 


Hs.129053 


446493 


AK001389 


Hs.15144 


414895 


AW894856 


Hs.1 16278 


442072 


AI740832 


Hs.12311 


425723 


NM.014420 


Hs.159311 


432901 


A1554929 


Hs.281866 



hypothetical protein FU 231 82 
ESTs 

v-Jun avian sarcoma virus 17 oncogene ho 
ESTs 

HT01 8 protein 

ubiquHin-Gke 5 

E3 ubtquIUn (igase SMURF2 

eukaryotic translation initiation factor 

ENSP00000251675':K1AA1550 protein (Fraflm 

human irnmunodeficiency virus type 1 enha 

Target Exon 

hypothetical protein 

ESTs 

ESTs 

Homo sapiens mRNA for KIAA1671 protein, 
phosphodiesterase 4 A, cAMP-speriftc (dun 
Homo sapiens mRNA; cDNA DKFZp434K0322 (f 
thyroid receptor interacting protein 15 
purinergic receptor (family A group 5} 
Homo sapiens cDMA: FU 22050 8s, done H 
Homo sapiens, clone IMAGE:3355383, mRNA, 
secretory leukocyte protease Inhibitor ( 
K1AA1538 protein 
KIAA1012 protein 
hypothetical protein FU20896 
ESTs. Weakly similar to 2004399A chromes 
C7000789:gil1943947|gblAAC48716.1| (U901 
ESTs 

ATPase, Class ), type 8B, member 1 
C15000305:gij3806122lgb[AAC69198.1l (AF0 
aldehyde dehydrogenase 3 family, member 
coagulation factor VII (serum prolhrombi 
RNA binding motif protein 3 
Homo sapiens cDNA: FU 22807 6s, done K 
Cdc42 effector protein 4; binder of Rho 
U5 snRNP-specific protein. 1 16 kD 
chromosome 21 open reading frame 15 
C7000789:gi|f943947[gb|AAC48716.1| (U901 
MAD (mothers against decapentaptegtc, Dr 
ESTs 

gb:CM0-HT0395-280100-169-b09 HT0395 Homo 
Homo sapiens cDNA: FU 22696 fe, clone H 
transmembrane protease, serine 4 
Homo sapiens cONA FU 1 1074 fis, done PL 
NM_00703r:Homo sapiens adisintegrin4i 
gb:RC6-BT0627.22030(M)t2-D06BT0627 Homo 
high-glucose-fegutated protein 8 
hypothetical protein FU 1081 3 
hypothetical protein FU 20559 
solute carrier family 7 (catlonic amino 
K1AA059Q gene product 
ESTs 

NM_005000':Homo sapiens NADH dehydrogena 
B-cell CLUIymphoma 1 1 A (zinc finger pro 
ESTs 

hypothetical protein FU10659 
hypothetical protein MGC12702 
protein phosphatase 2, regulatory subuni 
ESTs, WeaWy similar to ALU1.HUMAN ALU S 
ESTs 

RAB, member of RAS oncogene family-like 
ESTs 

C2000428*:giI7705383lreflNP - 057536.1| 6C 
BUP protein 
peroxiredoxin 5 
reticuton4 

hypothetical protein similar to preferen 
ESTs 

eufcaryotic translation elongation factor 
Wxjx and leudne-rich repeat protein 5 
Target Exon 

Homo sapiens NOTCH 1 (N1) mRMA, complete 
hypothetical protein DKFZp564O043 
Homo sapiens cDMA FU 13571 fis, clone PL 
Homo sapiens done 23570 mRNA sequence 
dickkopf (Xenopus laevis) rwmolog 4 
ATPase. H transporting , lysosomal (vacuo 

205 



1.51 


2.50 


1.51 


1.64 


1.51 


2.62 


1.51 


2.02 


1.51 


2.22 


1.51 


1,34 


1.51 


2.24 


1.51 


1.49 


1.51 


2.18 


1.51 


1.33 


1.51 


3.22 


1.51 


1.57 


1.51 


1.59 


1.50 


2.70 


1.50 


153 


1.50 


2.24 


1.50 


1.13 


1.50 


1.62 


1.50 


1.00 


1.50 


1.05 


1.50 


2.53 


1.49 


0.97 


1.49 


1.44 


1.49 


2.24 


1.49 


2.40 


1.49 


2.84 


1.48 


2.20 


1.48 


2.20 


1.48 


2.24 


1.48 


1.48 


1.48 


1.16 


1.48 


1.20 


1.48 


Z38 


1.48 


1.28 


1.4B 


2.73 


1.48 


2.53 


1.48 


2.03 


1.48 


2.00 


1.48 


1.25 


1.48 


Z35 


1.48 


2.08 


1.48 


1.00 


1.47 


1.91 


1.47 


2.26 


1.47 


1.24 


1.47 


2.00 


1.47 


2.90 


1.47 


4.08 


1.47 


3.13 


1.47 


1.06 


1.47 


1.51 


1.47 


2.39 


1.47 


1.00 


1.46 


2.20 


1.46 


2.13 


1.46 


1.98 


1.48 


1.45 


1.46 


2.31 


1.46 


1.30 


1.46 


2.48 


1.46 


1.05 


1.46 


2.12 


1.46 


2.40 


1.46 


1.65 


1.46 


1.56 


1.46 


1.49 


1.46 


Z45 


1.46 


ZOO 


1.45 


1.60 


1.45 


1.26 


1.45 


2.21 


1.45 


Z40 


1.45 


3.65 


1.44 


Z71 


1.44 


1.08 


1.44 


Z24 


1.44 


1.63 



WO 03/003906 



PCT/US02/21338 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 



412210 


AW901492 




421685 


AF189723 


Hs, 106778 


428115 


AB023194 


Hs.300855 


442358 


BE567985 


Hs. 18585 


414685 


L39874 


Hs.76894 


413798 


AA336708 


Hs.75546 


410937 


AA218564 


Hs.67052 


400397 


AJ270770 




405902 






433976 


AA6209B7 


Hs.1 90268 


405376 






436086 


Z43133 


Hs.9961 


418162 


AW0 16405 


Hs.16648 


430307 


BE513442 


Hs.238944 


434924 


AA443164 


Hs.23259 


417821 


BE245149 


Hs 82643 


404744 






405418 






402869 






451608 


AA384525 


Hs.26745 


424099 


AF071202 


Hs.1 39336 


401041 






417839 


AI815732 


Hs.82712 


409245 


AA361037 


Hs 288036 


447808 


NM-007265 


Hs.1 9873 


456492 


AA330047 


Hs.1 91 187 


449244 


AW859979 


Hs.32204 


413094 


H24184 


Hs.25413 


452407 


AA682909 


Hs.29353 


407674 


AW064061 


Hs779145 


441297 


AW403084 


Hs.7766 


421932 


W51778 


Hs.323949 


426348 


BE466566 


Hs.1 7433 


432554 


AI479813 


Hs.278411 


431735 


AW977724 


Hs.75968 


429953 


NMJXM376 


Hs.226581 


444037 


AV647686 


Hs.42733 


402144 






456758 


AA325170 


Hs 224527 


452322 


BE566343 


Hs 28968 


426863 


AL137657 


Hs.172803 


410684 


AA088500 


Hs.170298 


401784 






427523 


BE242779 


Hs 179526 


449269 


A! 554682 


Hs 175B70 


406467 






444339 


T96555 


Hs 31562 


431563 


AJ027643 


Hs 120912 


413343 


BE392026 


Hs 334346 


447537 






42821 1 


AA42421 1 


Hs 183176 


406248 






437412 


BF069288 


Hs.34744 


414653 


M24486 


Hs 7676B 


403885 
439459 


AF0RG779 


Hs.58013 


419075 


TB4266 


Ha 123927 


405022 






401346 


RF041451 




415660 


AIQMfiOJ 

niJuJUv < 




448023 




Hs 17D3R8 

TO. 1 » IMW 


435962 




ns.L? ityt 


432480 


AA205475 


Hs.275865 


414309 


AK000639 


Hs.75884 


440256 


U23841 


Hs.18851 


413809 


U5851 


Hs.851 


408176 


AK001553 


Hs.43436 


433960 


AW629188 


Hs.188929 


404178 






402449 






455604 


BE011183 




429221 


AJ821060 


Hs.198271 


422122 


AA383642 


Hs.1 11894 


406231 






405879 






450936 


AI033745 





403381 



gb^CO-NNl012-27030W)31*10 NN1012 Homo 
ATPase, Ca transporting, type 2C, member 
WAA0977 protein 

ESTs, Moderately similar to ALU4 JWMAN A 
dCMP deaminase 

capping protein (actin filament) muscle 
vacuolar protein sorting 26 (yeast homd 
transcription factor 7-fike 2 (T-cefl sp 
Target Exon 
ESTs 

Target Exon 

Homo sapiens cDMA; FU21954 fis, clone H 
ESTs 

hypothetical protein RJ 10531 
hypothetical protein FU 13433 
protein tyrosine kinase 9 
Target Exon 
Target Exon 
Target Exon 
hypothetical protein 

ATP-Wnding cassette, suWamily C (CFTR 
C1100(M253i|4507721|reflNP„0033iai| ti 
fragile X mental retardation, autosomal 
IRNA feopentenyipyrophosphate transferas 
suppressor of S. cerevisiae gcr2 ■ 
ESTs 
ESTs 

TOLUP protein 

brain-specific protein p25 alpha 
ESTs 

ubiqui&TMHfljugarjflg enzyme E2E 1 (homo 
kangai 1 (suppression of lumorigenichy 
hypothetical protein FU 20967 
NCK-associated protein 1 
thymosin, beta 4, X chromosome 
COXf 5 (yeast) homolog, cytochrome c oxid 
CHMP1 .5 protein 
Target Exon 

ESTs, Weakly similar to FAHUAA alpha-act 
glutaredoxln (thioltransferase) 
hypothetical protein MGC10327 
ESTs 

NM_002280*:Homo sapiens keratin, hair, a 
upregulated by 1,25^'hydroxyvitamin 0-3 
ESTs 

Target Exon 

ESTs 

ESTs 

hypothetical protein MGC13045 

ESTs, Weakly similar to AF193556 1 sacs) 

ESTs 

Target Exon 

Homo sapiens mRNA; cDNA DKFZp547C136 {fr 

proconagen-proline, 2-oxoglutarate 4-di 

Target Exon 

ESTs 

ESTs 

Target Exon 
hypothetical protein 

ubiquitin-ccnjugating enzyme E2G 1 (homo 

ESTs 

ESTs 

ribosomal protein S18 
DKFZP588A011 protein 
hypothetical protein FU 10875 
tntegrin, alpha E (antigen CO103, human 
adenylate kinase 3 alpha like 
ESTs 

C600l430*^|4503521|reflNP_001559.1| mu 
Target Exon 

gb:PM3-BN0218-100500-003409 BN0218 Homo 
Target CAT 

lysosomal-asscctatBd protein trans membra 
Target Exon 
Target Exon 

gb:ow23a10jc1 SoaresjjaralhyroidJumorJ* 
ENSP00000231844':Ecotropic virus Integra 

206 



4 AA 

1.44 


2.15 


4 AA 

1.44 


1.83 


4 AA 

1.44 


1.31 


1.44 


2.47 


1.44 


1.25 


1.44 


1.26 


1.44 


1.41 


1.44 


3.43 


1.44 


2.65 


1.44 


2.46 


1.44 


2.28 


1.44 


1,34 


1.44 


2.35 


1.43 


1.55 


1.43 


2.05 


1.43 


2.15 


1.43 


1.99 


1.43 


2.83 


1.43 


2.40 


1.43 


1.22 


1.43 


2.45 


1.43 


290 


1.43 


2.84 


1.43 


2.65 


1.43 


2.00 


1.43 


2.73 


1.42 


1.57 


1.42 


1,33 


1.42 


2.50 


1.42 


2.35 


1.42 


2.20 


1.42 


1.48 


1.42 


1.83 


1.42 


2.46 


1.42 


1.30 


1.42 


1.50 


1.42 


1.38 


1.42 


2.38 


1.42 


223 


1.42 


2.16 


1.41 


1.38 


1.41 


1.28 


1.41 


1.37 


1.41 


1.32 


1.41 


1.37 


1.41 


1.80 


1.41 


2.94 


1.41 


1.41 


1.41 


1.21 


1.41 


2.07 


1.41 


1.25 


1.41 


Z40 


1.41 


1.39 


1.41 


1.33 


1.41 


2,58 


1.41 


208 


1.41 


2.84 


1.40 


2.55 


1.40 


2.38 


1.40 


2.38 


1.40 


2.38 


1.40 


210 


1.40 


1.37 


1.40 


1.18 


1.40 


1.91 


1.40 


280 


1.40 


2.73 


1.40 


1.99 


1.40 


283 


1.40 


1.51 


1.40 


230 


1.40 


1.22 


1.40 


1.42 


1.40 


260 


1.40 


273 


1.40 


1.13 


1.39 


6.03 



WO 03/003906 



PCT/US02/21338 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 



453258 


AW293134 


Hs.32597 


448261 


BE244072 


Hs.20815 


427666 


A1791495 


Hs.180142 


413859 


AW992355 


H&8364 


407704 


BE315072 


Hs.78768 


430138 


AA936295 


Hs.234265 


432841 


M93425 


Hs.62 


444051 


N48373 


Hs.10247 


440704 


M69241 


Hs.162 


450092 


AW139606 


Hs.221057 


400275 






403725 






443211 


AI128388 


Hs.143655 


421510 


AKD00919 


Hs.105f9f 


430071 


AA3559B6 


Hs.232058 


451545 


A1802128 


Hs.208647 


439897 


NM_015310 


Hs.6763 


423872 


AB020316 


Hs.134015 


410344 


AW978436 


Hs.62515 


404439 






448581 


NM_002709 


Hs.21537 


408559 


BE066047 


Hs.86412 


447643 


H10767 


Hs.238465 


401593 






403807 






406356 


N47812 




401886 






421110 


AJ250717 


Hs.1355 


427449 


AW946384 


HS.17B112 


427451 


A1690916 


Hs.178137 


440661 


AW449696 


Hs.166547 


419590 


AF005043 


Hs.91390 


446044 


H67567 


Hs.13572 


400967 






414506 


AF075337 


Hs.76293 


402599 






422932 


AI191813 


Hs.308220 


433889 


AK002082 


Hs.3623 


429802 


H09548 


Hs.5367 


443856 


AK000574 


Hs.9908 


453489 


AA300067 


Hs.33032 


424670 


W61215 


Hs.1 16651 


428995 


AW004975 


Hs.194716 


441551 


AA318224 


Hs.296141 


450528 


NM.014072 


Hs.25063 


427605 


NM.000997 


Hs.337445 


459237 


AA031675 


Hs.31917 


413691 


AB023173 


Hs.75478 


404906 






436246 


AW450963 


Hs.1 19991 


441478 


AA350018 


Hs.301342 


419715 


AF070523 


Hs.92384 


426251 


M24283 . 


Hs.168383 


400129 






450447 


AF212223 


Hs.25010 


434697 


AL133033 


Hs.4084 


430308 


BE540865 


Hs.238990 


434767 


AF153201 




459729 


AL037285 


Hs.289848 


426653 


AA530892 


Hs.171695 


408912 


AB011084 


Hs.48924 


409844 


AW502336 




402517 






447042 


AB035863 


Hs.182217 


405000 
452065 


AK000360 


Hs.27721 


404666 






451081 


AI078645 


Hs.431 


427979 


BE379776 


Hs.181309 


435825 


R16702 


Hs.91147 


426469 


BE297886 


Hs.293970 


447002 


BE242866 


Hs.16933 


410946 


AW811502 




454383 


AW500332 


Hs.11114 


440512 


AAB87845 


Hs.19673 


409865 


AW502208 




447390 


X95384 


Hs.18426 



ring finger protein (C3H2C3 type) 6 
macrophage erythrobtast altacher 
catmodulin-like skin proleln (CLSP) 



malignant cell expression-enhanced gene/ 
DKFZP586G011 protein 
protein tyrosine phosphatase, norwecept 
activated leucocyte cell adhesion molecu 
insulin-tike growth factor binding prate 
ESTs, Weatfy similar to ALULHUMAN ALU S 
NMJ)05513*:Homo sapiens seryWRNA synth 
Targe) Exon 
ESTe 

hypothetical pcolein FU1Q057 
transcription factor 8 {represses interl 
ESTs 

K1AA0942 protein 
uronyl 2-su!fotransf erase 
K1AA0494 gene product 
ENSP00000067222*:Mitochondnai 28S ribos 
protetn phosphatase 1 , catalytic subunit 
chromosome 9 open reading frame 5 
nGAP-fike protetn 
Target Exon 

NMJJ31889:Homo sapiens enametin (ENAM), 
CGI-35 protein 

NM_021783:Homo sapiens XEDAR (XEDAR), mR 
cathepsin E 

DNA segment, single copy probe LNS-CAl/l 

transducer of ERBB2.1 

ESTs 

poly (AOP-ribose) glycohydrolase 
calcium modulating ligand 
Target Exon 
thymosin, beta 10 

NM_021 1 86*:Homo sapiens zona pellucida g 
ESTs 

hypothetical protein FU11220 

ESTs, WeaWy similar to 138022 hypotheti 

nitrogen fixation cluster-like 

hypothetical protein DKFZp434N185 

epithelial V-iike antigen 1 

MAD (mothers against decapentapSegtc, Dr 

ESTs 

PRO0461 protein 
rfbosomal protein L37 

Homo sapiens, done MGC:9658, mRNA, comp 
ATPase, Class V!, type11B 
NrVL025213:Homo sapiens spectrin, beta, n 
ESTs 

hypothetical protein MGC4342 
vitamin A responsive; cytosketeton relat 
intercellular adhesion molecule 1 (CD54) 
Eos Control 

hypothetical protein P15-2 
WAA1D25 protein 

cyciin-dependent kinase inhibitor 1B (p2 
C2H2 (KruppeRype) zinc finger protein 
EST, Weakly similar to ALU4_HUMAN ALU SU 
dual specificity phosphatase 1 
K1AA0512 gene product ALEX2 
gb:UI-HF-BR0p-aka^5-04Jl.r1 NIH JAGC.5 
Target Exon 

succinate- CoA iigase, ADP-forming, beta 
Target Exon 

Wolf-Hirschhorn syndrome candidate 
C9000748:gtj8324209|gb|AAB34384.2| {S775 
murine leukemia viral (bmM) oncogene h 
proteasome [prosome, macropain) subunit 
ESTs 

nTethytrnaionate-serniaidehyde dehydrogenas 
HepA-retated protetn 

gb:OV2-ST014$461299-01SU>4 ST0145 Homo 
hypothetical protein dJ1 1 81 N3.1 
suppressor of S. cerevisiae gcr2 
gb:UJ-HF.8R0p-aju-£M)9-O-Ui.r1 HlHJJIGCJ 
translaflonal inhibitor protein p14.5 

207 



1.39 


3.20 


1.39 


1.33 


1.39 


2.30 


1.39 


1.53 


1.39 


1.34 


1.39 


2.38 


1.39 


2.88 


1.39 


1.34 


1.39 


1.61 


1.39 


2.78 


1.39 


2.03 


1.39 


2.03 


1.39 


1.83 


1.39 


2.83 


1.38 


3.54 


1.38 


2.21 


1.38 


3.65 


1.38 


1.00 


1.38 


2.25 


1.38 


2.25 


1.38 


1.47 


1.38 


1.27 


1.38 


1.22 


1.38 


2.58 


1.38 


Z38 


1.38 


2.25 


1.38 


2.00 


1.38 


8.93 


138 


1.44 


1.38 


2.81 


1.38 


2.95 


1.38 


2.10 


1.37 


262 


1.37 


3.12 


1.37 


1.18 


1.37 


2.68 


1.37 


2.38 


1.37 


2.23 


1,37 


2.25 


L37 


1.28 


1.37 


2.15 


1.37 


1.66 


1.37 


1.33 


1.37 


2.95 


1.37 


1.19 


1.37 


1.31 


1.37 


250 


1.37 


1.31 


1.36 


3.06 


1.36 


1.00 


1.36 


1.43 


1.36 


1.28 


1.36 


216 


1.36 ■ 


203 


1.36 


213 


1.36 


201 


1.36 


2.03 


1.36 


2.87 


1.36 


1.27 


1*35 


220 


1.35 


268 


1.35 


229 


1.35 


210 


1.35 


1.25 


1.35 


232 


1.35 


236 


1.35 


255 


1.35 


1.70 


1.35 


223 


1.35 


239 


1.35 


'1.40 


1.34 


268 


1.34 


202 


1.34 


213 


1.34 


205 


1.34 


263 


1.34 


1.00 



WO 03/003906 



PCT/US02/21338 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 



450233 


N36754 


Hs.171118 


445831 


NM_006055 


Hs.13351 


418610 


AW245993 


Hs.223394 


441946 


AW298715 


Hs.120775 


446192 


H49944 


Hs.14231 


416285 


BE537973 


Hs.48617 


425590 


AI954686 


Hs.158321 


407498 


U28131 




441331 


A1216754 


Hs.149971 


411789 


AF245505 


Hs.72157 


420542 


NM000505 


Hs.1321 


413892 


AI878921 


Hs.75607 


439750 


AL3S9053 


Hs,57664 


414861 


AL1 19396 


Hs.77508 


421687 


AUJ35306 


Hs.106823 


410846 


AW807057 




443937 


R66571 


Hs.24601 


432360 


BE045243 


Hs.274416 


443119 


AA312264 


Hs.7980 


438464 


AA669735 


Hs.324743 


401371 






405443 






453764 


BE008180 


Hs.282846 


424924 


AL039103 


Hs.153834 


453555 


N23574 


Hs. 123649 


404343 






412383 


AW947577 




404250 






413899 


AF083892 


Hs.75608 


422716 


AI702835 


Hs.1 24475 


448862 


AI351979 


Hs.152717 


409540 


AW409569 




431186 


NMJJ12249 


Hs.250697 


402754 






420798 


W93774 


Hs.99936 


459710 


AI701596 


Hs.1 21592 


435192 


AK000739 


Hs.4835 


401383 






453394 


AW960474 


Hs.40289 


421820 


AW662990 


Hs.294133 


444047 


AI097452 


Hs.1 35095 


440860 


R10482 


Hs.132876 


425808 


AA364109 


Hs.177990 


456558 


BE410992 


Hs.258730 


447016 


AB033029 


Hs.16953 


414015 


AA340987 


Hs.75693 


414843 


BE386038 


Hs.77492 


424058 


AL121516 


Hs.138617 


401196 






450147 


AW373713 


Hs.146324 


422699 


BE410590 


Hs.1 19257 


405172 






434087 


AF1 16675 


Hs.334476 


416720 


H05435 


Hs.1 11 10 


426621 


NM_001329 


Hs.171391 


442685 


AB033017 


Hs.8594 


443879 


Z28462 


Hs.9927 


405180 


NWL002649 




417365 


050683 


Hs.82028 


402087 






429323 


NMJX)1649 


Hs.2391 


409935 


AW511413 


H3.27B025 


430235 


BE268048 


Hs.236494 


400172 






421742 


AW970004 


Hs.107528 


404273 






416204 


AW972270 


Hs.144054 


435076 


AW298113 


Hs.92909 


452497 


AA7321S3 


Hs.27865 


404598 






419060 


AW150835 


Hs.18878 


427195 


W27230 


Hs.173912 


438129 


AA778547 




402138 






404029 






402731 


AUM2818 




455766 


AW183518 


Hs.55610 



hypolhelica] protein FU00026 
UnC (bacterial lantihtoUc synthetase c 
hypothetical protein MGC2742 
ESTs 

selenoprotein W, 1 

Homo sapiens cDNA FU12540 fis, clone NT 
beaded filament structure) protein 2, ph 
gb:Human HMGi-C chimeric transcript mRNA 
ESTs. Moderately similar to ALUB.HUMAN ! 
Adiican 

coagulafion factor XII (Hageman factor) 
myristoylated aianine-rich protein kinas 
Homo sapiens mRNA full length insert cDN 
glutamate dehydrogenase 1 
hypothetical protein MGC14797 
gb:MR4-ST0062.031199-018-b03 ST0062 Homo 
ESTs 

Target CAT 

hypothetical protein MGC12966 
protein phosphatase 4 regulatory subunil 
ENSP00000198192*:BA438F9.1 (novel protei 
Target Exon 

Homo sapiens cDNA FU14353 fis, clone V7 
pumilio (Drosophila) homolog 1 
ESTs, Moderately similar to ALU7.HUMAN A 
C7O02M *:gi)5053028}gblAADmi J. )\tf)5S 
gb:ROW0004-1403Q0-031-b09 MT0004 Homo 
Target Exon 

tight junction protein 2 (zona occludens 
ESTs, Weakly similar to YEF4.YEAST HYPOT 
hypothetical protein FU13725 
gb:fh01e09.x1 NIHJAGCJ 7 Homo sapiens cD 
ras-tfce protein 

NMJ)22469':Homo sapiens rr/pothelfcal pro 
keratin 10 (epidermolytfc hyperkeratosis 
ESTs 

eukaryotic translation initiation factor 

Target Exon 

ESTs 

heme-binding protein 

ESTs 

ESTs 

ESTs 

heme-regulated initiation factor 2-atpha 
KIAA1203 protein 

prolylcarboxypeptidase (angiotenshase C 
heterogeneous nuclear ribonucleoprotein 
thyroid hormone receptor Interactor 12 
Target Exon 
CGM45 protein 

ems1 sequence (mammary tumor and squamou 
Target Exon 

hypothetical protein PR01942 
hypothetical protein MGC2508 
C-terminal binding protein 2 
KIAA1191 protein 

Homo sapiens mRNA; cDNA DKFZp5640156 (fr 
phosphoinosiiIde-3-klnase, catalytic, ga 
trans forming growth factor, beta recepto 
Target Exon 

apical protein, Xenopus laevls-like 
ESTs 

RAB10, member RAS oncogene family 
Eos Control 

androgen induced protein 

Target Exon 

ESTs 

SON DNA binding protein 

Homo sapiens cDNA: FU21333 fis, clone C 

Target Exon 

hypothetical protein RJ21620 
eukaryotic translation initiation factor 
gb:afB7d03.s1 Soares Jestis.NHT Homo sap 
Target Exon 

NM_018936*:Homo sapiens protocadherin be 

E3ubiquitIn»gaseSMURF1 

solute carrier family 30 (zinc transport 

208 



1.34 


2.45 


1.34 


1.60 


1.34 


1.39 


1.34 


230 


1.34 


1.17 


1.34 


2.22 


1.34 


250 


1.34 


213 


1.34 


205 


1.34 


1.27 


1.33 


1.25 


1.33 


1.41 


1.33 


1.99 


1.33 


1.66 


1.33 


218 


1.33 


207 


1.33 


202 


1.33 


1.12 


1.33 


268 


1.33 


1.99 


1.33 


1.10 


1.33 


2.11 


1.33 


288 


1.33 


1.24 


1.33 


223 


1.33 


1.04 


1.33 


2.06 


1.33 


253 


1.33 


2.81 


1.33 


230 


1.33 


1.08 


1.33 


2.18 


1.32 


1.39 


1.32 


1.16 


1.32 


202 


1.32 


270 


1.32 


2.22 


1.32 


218 


1.32 


2.20 


1.32 


1.24 


1.32 


295 


1.32 


283 


1.32 


211 


1^32 


205 


1.32 


1.30 


1.32 


1.39 


1.32 


1.26 


1.32 


201 


1.32 


213 


1.32 


1.32 


1.32 


1.33 


1.32 


211 


U2 


230 


1.32 


218 


1.32 


1.53 


1.32 


1.43 


1,31 


224 


1.31 


1.36 


1.31 


0.98 


1.31 


1.31 


1.31 


205 


1.31 


1.20 


1.31 


1.31 


1.31 


1.05 


1.31 


1.79 


1.31 


235 


1.31 


215 


1.31 


205 


1.30 


1.57 


1.30 


223 


1.30 


1.15 


1.30 


1.34 


1.30 


255 


1.30 


209 


1.30 


283 


1.30 


3.32 


1.30 


1.56 



WO 03/003906 



PCT/US02/21338 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 



434585 


AW451715 


Hs. 184075 


417219 


AW973473 


Hs.220936 


428125 


AA393071 


Hs.182579 


416188 


BE157260 


Hs.79070 


444581 


AJ243937 


Hs.288316 


406621 


X57809 


Hs.181125 


436663 


AW410458 


Ks.5258 


417250 


N5B241 


Hs.332115 


434978 


AA321238 


Hs.4310 


448079 


R76981 




450626 


AW1 90989 


Hs.1508 


456059 


BE543127 


Ks.336948 


417809 


H75797 


Hs.233550 


454771 


AW819939 


Hs.273629 


413895 


BE178160 




404649 






440576 
405891 


NM.004987 


Hs.1 12378 


418965 


AJ002238 


Hs.11482 


412824 


AW958075 


Hs.11251 


420037 


BE299598 


Hs.135569 


459221 


BE246522 


Hs.306121 


458651 


AW612481 


Hs.104105 


422984 


W28614 




459365 


BE067754 




418254 


AA732511 


Hs.86650 


402474 






448456 


AI521830 


Hs.171050 


450098 


W27249 


Hs.8109 


405053 






428915 


AI041278 


Hs.87908 


443721 


AW450451 


Hs.265355 


452047 


N35953 


Hs.43510 


440213 


AW246253 


Hs.7043 


452900 


AA626794 




418721 


NM.002731 


Hs.87773 


458911 


AA373131 


Hs.24322 


444250 


R40815 


Hs. 12396 


431631 


AA548906 


Hs.122244 


447966 


AA340605 


Hs. 105887 


430316 


NM.000875 


Hs.239176 


416272 


AA1~78882 




437456 


AL047045 


Hs.60293 


456327 


H68741 


Hs.38774 


403349 


mm 001406 




428821 


H91282 


Hs.286232 


454555 


AW807095 




406872 


AI760903 




401720 






400082 






HC\i 100 


W92885 


Hs 143408 


411579 


AC005258 


Hs.70830 


402191 






457118 




Hs 182469 


4ftRC7fi 

*twOO/ D 


MM fUW>4? 


Hs.46423 


452826 




Hs 301 636 


414909 


RB0316 


Hs, 132569 


416114 


A1695549 


Hs.1 83868 


455476 








AFC54284 


He 334826 


432647 


AI807481 


Hs778581 

nji4 i Www ■ 


405436 






406140 






426201 


AW182614 


Hs.1 28499 


433334 


A1927208 


Hs.231958 


423262 


NM_005479 


Hs.126057 


422929 


AA356594 


Hs.94011 


445605 


AJ906088 


Hs.87159 


425050 


BE391854 


Hs.7970 


420539 


AA282735 


Hs.44004 


437352 


AL353957 


Hs.284181 


455535 


AA305079 


Hs.1342 


434202 


BE382411 


Hs.3764 


439528 


BE613180 


Hs.268358 


400178 






430023 


AA15B243 


Hs.227729 


412841 


AI751157 


Hs.101395 



ESTs, Weakly similar to ALU INHUMAN ALU S 
EST8 

leucine amlnopeptldase 

v-myc avian myelocytomatosts viral oncog 

chromosome 6 open leafing frame 9 

Immunoglobulin lambda locus 

chromosome 1 1 open reading frame2 

ESTs 

euxaryolic translation Initiation factor 

thyroid hormone receptor-associated prot 
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AW190219 AW129532 AJ954133 AI668869 AI383948 A1537386 AA213788 A1088416 A1360826 AA101B02 A1619505 AA101795 AA101723 
AW614392 R64287 AA927599 A1251904 AIB03003 AI675123 AW023079 AW134959 A1926156 AA831069 AI638324 F29901 R46085 F03383 
C17731 R20584 T90131 

A1033745 A1034133 AA844424 AW166024 A1B31699 AI971097 AA01 1685 H70852 H70851 

A^2CTW 8 ^6n9^7 6 1274 AA928041 A1954235 N71035 N70230 AW674412 A1871 136 Al 563955 AI954237 AA649543 A1340231 
A1368586 AI868721 W44486 TB3736 AA126250 AI343619 H94297T47633 AI672897 AA495355 R23240 AI814680 AA9TC11 IS 'M644262 
N67040 AW074273 AI357512AA865354 AI027942 R33837 H95828 N63928 A1418701 A! 1 86469 AA693672 AA778429 AA1 28352 AW954072 

wooisXiSaw^ 

AI383091 AA128493 T83907 T47632 AA0941 1B AA495405 H94191 AW887759 N98454 AA51 2988 A1623761 AW028373 AW249740 
AI187029 AA991733 AI583085 A1336036 060550 D60466 050009 060248 AW262673 A1524080 AA205643 AI969141 AW674989 AA093704 
R81248 R73069 W01019 R23316 C14688 W44485 AA093663 T70157 H57883 R94790 D81287 D81788 AA364340 AA383533 N41706 
BE295144 ST DB1466 060551 R33836 060249 AA354560 W01933 T71478 AA515461 F27984 BE084746 W19084 R81247 AI080252 

AW^243AaSo AA412478 AA82B434 AA814538 A192741 8 A1192435 W52897 AA4436S6 AA031913 AI683306 AA918481 A 18331 4 
D83907 A1206832 AA876122 D83836 083838 082533 AI761290 Al 191 125 All 43749 AW771909 AI241436 A1767267 W56507 AA8477B7 

AA643838 AI493761 AA032024W38849 AA340178AA447052 AA452969 W19369 AA296364 H44229 W58767 C05751 C05835 AI741989 

N98532 AW102617 AA412583 AI922246 W38495AA355375 AA928571 C06275 AA352500 N93132 

AW582568 AW818655 AW81B647 AW818655 AW818637 AW818234 

AW807095 AW807022 AW845880 AW807096 AW807461 AW846116 AW807070 

AW815123AW815138AW815259 

R41084AW875856 

A«aS AVS 

AW9481 21 AW948068 AW948109 AW9481 05 AW948097 AW9481 20 AW9481 08 AW9481 1 1 AW9481 14 AW9481 15 AW948072 AW948083 

BE008015 BE008021 BE008028 BE008023 BE008030 BE008014 
BE011183BE011170BE011333BE011188BE011181 BE011324 BE011161 BE011169 
BE145837 BE145894 
AW975000 AA658945 AA661558 

AWH6^ AW176203AW176181 AW176213 



Pkey. Unique number correspondlnQ to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this coiumn are Genbank Identifier (Gl) numbers. 

sequence of human chromosome 22.' Dunham I. el aL, Nature (1999) 402:489-495. 
Strand- Indicates DMA strand from which exons were predicted. 
NLposition: Indicates nucleotide positions of predicted exons. 



"Dunham 1. el aL" refers to the publication entitled The DNA 



Pkey 

400634 

400750 

400752 

400772 

400773 

400788 

400789 

400835 

400843 

400844 

400845 

400846 

400847 

400957 

400967 

401038 

401041 

401058 

401155 

401177 

401196 

401204 

401286 

401346 

401371 



Ref Strand NLposition 

8567750 Minus 101102-101223.101886-102018 

8119067 Plus 198991-199168,199316-199548 

7331445 Minus 36215-36461 

8131629 Minus 34B96-35021.41078-41197 

8131629 Minus 4411644238,48208-48321 

7342055 Plus 184369-184715 

8307741 Plus 82281-83693 

K Pto ?Ko*Wm^^ 

9188605 Rus 24746-24872,25035-25204 

9188605 Plus 34428-34612 

9188605 Plus 39310-39474 

9188605 Plus 44643-44835 

7705148 Minus 66959-67241 

7770682 Minus 32697-32999 

7232177 Minus 4277-4469 

7232177 Ptus 44750-45076 

8117654 Minus 45226-45414 

9438289 Ptus 31381-31526 

9438503 Minus 62773-63330 

9719673 Plus 33138-33834 

9743388 Minus 33694-33872 

9801342 Minus 147036-147318 

9926605 Minus 12031-13032 

9850602 Plus 80901-81283 
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401383 


6721135 


Minus 


155543-157381 


401496 


7381769 


Minus 


82790-83002 


401512 


7622346 


Pius 


136399-136557 


401563 


8247910 


Plus 


91395-91763 


401572 


8570271 


Minus 


78651 -78839 


401593 


7230957 


Plus 


10388-105721 1293-12358 


401600 


4388746 


Minus 


273R3-27S18 28727-28891 2952R-2Q731 

CI wVirti O I0,£0i £l"uXI9 1 ,£90£0^£9/ J 1 


401603 


7689963 


Minus 


116659-116780 

1 < WWJ3* 1 1 VI U\J 


401613 


4878062 


Plus 


22461 -22B31 


401654 


9097132 


Minus 


64695-64797 


401660 


9100664 


Minus 


173662-174024 

1 f JUUt 1 f HV£"t 


401720 


6468551 


Plus 


7783-8468 


401784 


7249190 


Plus 


148362-148606 149453-149535 149731-149962 


401835 


7139700 


Plus 


142257-142742 


401886 


7229913 


Minus 


79215-79393 


401936 


3808091 


Plus 


46817-46943 


402001 


9501618 


Rus 


68052-68223 


402087 


8117546 


Plus 


1 37069-1 37213, 138678-1 38828,138969-1 39050 


402138 


7704985 


Plus 


14173-15108 


402144 


7242326 


Plus 


115425-115977 


402190 


8576067 


Minus 


76488-76959 


402191 


8576073 


Minus 




402329 


7798735 


Plus 


15833-16180 18419-18715 22507-22624 


402368 


9558577 




47218-47330 48052-48203 

tit 1 0— *l OOU|^OUO£*H0£IA7 


402371 


9558584 


Plus 


R873fi-fi8956 

OOf OO-OQOOO 


402378 


9625333 


Minus 


41 31 2-41 46B 48313-48720 


402449 


3/ SUUf H 




5Qafi7-fifl03Q 82588-82828 63485-6362364923455108 

OOUUf ^jUUO3»U£OOv^£U£O,U0HUO^OU&O,9H7.&i7'VO IvU 


402474 


7547175 


Minus 


5352fi-5382ft 55755.55920 57530-57757 

OOJtO* OOOi 0,00 f 0O*003iU,0l OOLrO/ IOI 


402517 


a* 90 IUQ 


Plus 


17569l 17721 

1 / OOO - 1 f f CI 


402599 


7239RRR 


[J 8 


5R35-5Q87 


402617 


QQ307Q7 


Minus 


R94RR-fi9945 


402712 


8989253 


Minus 


10941-11138 


402731 


9211639 




117913-118004,121110-121211,121327-121457,125478-125623.126540-126663 


402737 

*VJCI Or 


9212184 


Minus 


13358-13552 

I oooo- IOOO& 


402754 


9213730 


Plus 


15345-15852 


402760 


9213669 


Plus 


136829-136952137336-137521 


402812 


Rmo.no 




2502R.25O91 25R44-25920 


/fi2R45 


Q3RQ28R 


Plus 


180451-150517 150788-181009 


4n2RRQ 


R434R43 
O-JOHOHO 


Minus 


13RR3Q-1 39335 

1 0 DDOO- 1 03000 


HUcOol 


RflRRRAd 


D» nUS 


J OHOO-f" 1 OH OHO 


HVoU IU 


O 1 OtOlQ 


nus 


7838^-790^2 


4H314Q 


Q7qqors 

37UJ000 


Plus 


?Rn34-25185 

tOUO*?-«iO IOO 




0*T*tUU£0 




1 IUOOT* 1 1 1 ICC 


4rn32Q 

HUOO&3 


00 10 l£U 


Plus 


OU*» JW3UOOO 


403342 


72334R7 




42312-43750 


huooho 


0CCQ77Q 
00 037 / O 


Minus 


1R7R15-1R8374 


4n33fli 


Q43R2R7 


Minus 


26009-26178 


403463 


0Q2Q53R 


Plus 


102596-102879 


4n~tRi*<; 

-too ODD 


R73R0Q3 




RRRRR-RRR-iQ 


403725 


7534031 




86737-86843 


4fl372R 

HUO/ CO 




Ml 


34481-34671 


HUOOU/ 


6439933 


Minus 


162963-165773 




QRTRCWJ 
aooozu? 


Plus 


1211Q7-12135R 

1 A. i 1 Of " 1 C 1 Jw 


/rnRRt: 
huoooo 


771 Mm 

// IUHUJ 


Minus 


53259-53524 


/fHQ43 


7711RR4 
//II OO** 




100742-100904 101322-101503 


HUoooO 


00/ DUO/ 


rlUS ' 


1R2R1-164B2 


4fWfl2Q 
HUhUIo 


7R71959 
(Or lto£ 


Plus 


108716-111112 


hUhUov 


OO40/O0 


Plus 


00307-101808 

OO JO I Ivl own 


hU-hJoo 


11 RRR1Q 
J 1 000 1 □ 


Plus 


47310-47450 


4U41/CJ 


fO 003/0 


Minus 


17RIY75-17R3R3 

i / OU/ O- 1 * DOOJ 


hUhAW 


71 RQ1 1 9 


Minus 


7QRRRJ?0321 


HUHAjl 


Q">iRnv; 
oziouoo 


Maws 


O l U f / O 1 o&x 


404250 


9187145 


Minus 


3RffQQ^"tR"H 2 37Q2R_3Rfl7S 


404273 


9885189 


Plus 


97789-98285,99601-99855 


404343 


9838093 


Plus 


122664-122931 


404349 


7630858 


Minus 


61006-61187 


404391 


3135305 


Minus 


26030-2$173 l 27852-27997 


404392 


3135305 


Minus 


29738-29857 


404397 


9558608 


Minus 


104042-104232 


404438 


6984205 


Plus 


6341363553 


404439 


7139680 


Plus 


55316-55585 


404530 


6479107 


Plus 


3157-3304 


404596 


9958262 


Minus 


104807-105043 


404649 


9796926 


Minus 


100027-100399 


404666 


7272179 


Minus 


18677-18993 


404687 


9797554 


Minus 


128456-128565 


404744 


9167237 


Rus 


71776-71852,726^5-73019,73700-73822,74692-74850 
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404751 


7630939 


Plus 


113799-114252,114393-114715 


404769 


8099713 


Minus 


175801-176823 


404790 


7230958 


Plus 


38811-38761 


404906 


7331453 


Minus 


100985-101126 


404956 


7387343 


Plus 


55883-56203 


404977 


3738341 


Minus 


43081-43229 


405000 


• 6957544 


Minus 


88854-89993 


405022 


7330304 


Plus 


217163-217439 


405053 


7651944 


Minus 


157134-157430 


405058 


7655685 


Plus 


150740-151556 


405155 


9966228 


Plus 


130469-130723 


405163 


9966267 


Minus 


161171-161299 


405165 


9966302 


Plus 


6461-6845 


405166 


9966302 


Plus 


40526-40891 


405172 


9966752 


Rus 


153027-153262 


405180 


7139743 


Plus 


65438-65740 


405193 


7230072 


Rus 


128187-128383 


405194 


7230072 


Plus 


190465-190645,193346-193610 


405245 


7249293 


Minus 


57560-58312 


405365 


2275192 


Minus 


1 1 9867-1 20372.1 20481 -1 20824, 1 21029-1 21 357 


405376 


1552533 


Rus 


28875-29099 


405418 


6997292 


Plus 


51839-51953 


405436 


7408068 


Minus 


55716-55979 


405443 


7406143 


Pius 


90716-90887,101420-101577 


405474 


8439761 


Rus 


172005-172175 


405502 


9211311 


Minus 


50360-50584 


405580 


4512267 


Rus 


169232-169647 


405595 


7159256 


Rus 


47585-47688 


405630 


4508116 


Minus 


103218-103291,105858-105993,110051-110126 


405646 


4914350 


Rus 


741-969 


405710 


5531256 


Minus 


66203-66832 


405806 


7274891 


Minus 


224961-226780 


405812 


4775630 


Minus 


29424-29764 


405855 


7652031 


Minus 


60377-60795 


405879 


6758747 


Minus 


54789-55457 


405891 


6758795 


Plus 


41062-41861 


405902 


6758795 


Minus 


82322-83110 


405932 


7767812 


Minus 


123525-123713 


40603B 


8389537 


Rus 


37764-37877 


406140 


9168231 


Minus 


49887-50219 


405181 


5923650 


Rus 


165B6-16B55 


406231 


7417725 


Plus 


17206-17641,17772-17968 


406248 


7417725 


Plus 


49711-50227 


406274 


7543787 


Rus 


932-1123 


406356 


7107907 


Plus 


18761-18973 


406404 


9256305 


Minus 


152569-152874 


406467 


9795551 


Rus 


162212-182958 


406557 


7711569 


Minus 


5445-5574,6170-6352 


406575 


7711679 


Rus 


142034-142473 
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TABLE 10A Genes preferentially expressed In non-Invasive bladder tumors 

Pkey: Unique Eos prcbeset identifier number 

ExAccrt Exemplar Accession number, Genbank accession number 

Un'genelDc Unigene number 

Unlgene Title: Unigenegene title 

51 80th percentile of Ta tumor Ais divided by the 80th percentile of T2-T4 tumor A)s 



Pkey 


ExAccn 


UnigenelD 


Unigene Title 


R1 


421110 


AJ250717 


Hs.1355 


cathepsin E 


8.23 


428551 


AF196478 


Hs.1 88401 


annexin A10 


5.78 


451568 


243948 


Hs.326444 


cartilage acidic protein 1 


5.53 


415511 


A1732617 


Hs.1 82362 


ests 


472 


428336 


AA503115 


Hs.183752 


rnicrosemfnoprotein, beta- 


4.65 


418026 


BE379727 


Hs.83213 


fatty acid binding protein 4, adipocyte 


4.62 


400752 






NH_003105*:Homo sapiens sorfiBn-retated 


3.99 


430315 


NW.004293 


Hs.239147 


guanine deaminase 


3.82 


403010 






C21000152:gi|6226483lsplQ5211BlYMO3 - ERWS 


3.56 


404977 






InsuHn-iike growth factor 2 (somatomedi 


3.54 


425657 


NM.015865 


Hs.171731 


solute carrier family 14 (urea transport 


3.51 


400409 


AF153341 




Homo sapiens winged hefix/fortad trans 


3.38 


400844 






NM_003105\Homo sapiens sortfllrwelated 


3.27 


406081 






Target Exon 


3.22 


417275 


X6357B 


Hs.295449 


parvalbumin 


3.03 


402230 






Target Exon 


196 


454219 


X75042 


Hs.44313 


v-ref avian reticutoendolheifosis waf 


289 


403381 






ENSP00000231844*:Ecotropic virus Integra 


2.87 


426088 


AF038007 


Hs.166196 


ATPase, aass 1, type 6B, member 1 


186 


452286 


AI358570 


Hs.1 23933 


ESTs, WeaWy sMar to ZN91_HUMAN ZINC 


2.69 


434061 


AW024973 


Hs.283675 


NPD009 protein 


2.66 


418406 


X73501 


Hs.84905 


cytokeratto 20 


2.65 


418818 


AA228899 


Hs.101307 


Homo sapiens HUTU protein mRNA, partial 


159 


421594 


R45689 


Hs.21889 


■ Homo sapiens cDNA FU 12978 fis, dons NT 


2.57 


403383 






Target Exon 


156 


435563 


AF210317 


Hs.95497 


solute carrier family 2 (facilitated gtu 


2.55 


424800 


AL035588 


Hs.1 53203 


MyoO family inhibitor 


2.54 


404606 






Target Exon 


2.53 


416205 


L21715 


Hs.83760 


troponin 1, skeletal, fast 


2.53 


431912 


AI660552 


Hs.76549 


ESTs, Weakly similar to A56154 AM subst 


2.52 


413786 


AW613780 


Hs.13500 


ESTs 


2.51 


421100 


AW351839 


Hs.124660 


Homo sapiens cDMA FU21763 fis. clone C 


2.50 


416640 


BE262478 


Hs.79404 


neuron-specific protein 


150 


420729 


AW964897 


Hs.290825 


ESTs 


2.50 


402844 






C1000118^gi|9951913|rer]NPJ)6283l1| pr 


148 


401093 






C12000586':gi|6330167|dbj|8AA86477.1| (A 


146 


417720 


AA205625 


Hs.208067 


ESTs 


145 


400297 


AI127076 


Hs.306201 


hypothetical protein OKRp5640f 278 


145 


403818 






Target Exon 


2.44 


440273 


AI805392 


Hs.325335 


Homo sapiens cDNA: FU23523 fis, clone L 


144 


418060 


AA211589 


Hs.208047 


ESTs 


2.40 


400843 






NMJ)03105*:Homo sapiens sortilin-reJated 


138 


446006 


NM.004403 


Hs.13530 


deafness, autosomal dominant 5 


2.35 


401512 






NM_014080:Homo sapiens dual oxidase-like 


134 


446847 


T51454 


Hs.82845 


Homo sapiens cDNA FU21930 fis, done H 


132 


417094 


NMJ3QBB95 


Hs.81182 


histamine N-meihyltransf erase 


131 


436293 


A1601188 ' 


Hs.120910 


ESTs 


130 


436246 


AW450963 


Hs.119991 


ESTs 


130 


447578 


AA912347 


Hs.136585 


ESTs, Weakly similar to JC5314 CDC28/cdc 


129 


417381 


AF164142 


Hs.82042 


solute carrier family 23 (nudeobase tra 


128 


426028 


NM.001110 


Hs.172028 


a dislntegrin and metaUoprotelnase doma 


127 


431448 


AL137517 


Hs.306201 


hypothetical protein DKFZp56401278 


126 


437181 


AJ3Q6615 


Hs.125343 


ESTs, Weakly similar to K1AA075B protein 


123 


415025 


AW207091 


Hs.72307 


ESTs 


118 


412610 


X90908 


Hs.74126 


ratty acid binding protein 6, ileal (gas 


104 


424099 


AF071202 


Hs.139336 


ATP-binding cassette, sub-family C (CFTR 


103 


433078 


AW015188 


Hs.121575 


Homo sapiens cDNA FU 12231 fis, done MA 


101 


416225 


AA577730 


Hs.188684 


ESTs, Weakly similar to PC4259 ferritin 


100 


411880 


AWB72477 




gb:hm30mx1 NCLCGAP_Thy4 Homo sapiens 


1.99 


452316 


AA298484 


Hs.61265 


ESTs, Moderately similar to G786.HUMAN P 


1.89 


413804 


T54682 




gb:yc48b0lr1 Stratagene Bver (937224) 


1.88 


432306 


Y1B207 


Hs.303090 


protein phosphatase 1. regulatory (lnhQ> 


1.76 


405364 






ENSP00000239138*:Guanine nucleofide-bind 


' 1.60 


41432) 


U13616 


Hs.75893 


ankyrin 3, node of Ranvier (ankynh 6) 


1.52 


401929 






C17001690:gl|6005701IreflHP.009099.1| AT 


1.00 



WE 106 
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Pkey: Unique Eos probeset Identifier number 
CAT number Gene duster number 
Accession: Genbank accession numbers 



Pkey CAT Number Accession 

411880 1263110 1 AW872477 BE088101 T05990 

413804 1390710.1 T64682 BE 168190 BE 168256 

TABLE IOC 



Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers In this column are Genbank Wenfifier (Gl) numbers. 'Dunham L el al." refers to the publication entitled The DNA 

sequence of human chromosome 22/ Dunham I. et at, Nature (1999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 
NLposition: Indicates nucleotide positions of predicted exons. 



Pkey 


Ref 


Strand 


400752 


7331445 


Minus 


400843 


9188605 


Plus 


400844 


9168605 


Plus 


401093 


8516137 


Minus 


401512 


7622346 


Plus 


401929 


3810670 


Minus 


402230 


9986312 


Minus 


402844 


9369286 


Plus 


403010 


3132346 


Plus 


403381 


9438267 


Minus 


403383 


9438267 


Minus 


403818 


8962065 


Minus 


404606 


9212936 


Minus 


404977 


3738341 


Minus 


405364 


2281075 


Minus 


406081 


9123881 


Minus 



NLposition 
36215-38461 

586^5970,7653-7784,8892-9023.9673-9807,10634-10789,15254-15403.23827.23958 

24746-24572,25035-25204 

22335-23166 

136399-136557 

3167-3286,42164310 

29782-29932 

54958-553f3 

78385-79052 

26009-26178 

119837-121197 

138360-138512,144656-144796 

22310-23269 

4308143229 

4832548491,4913649252 
38115-38691 
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TABLE 1 1 A: Genes prefarenflalry expressed In muscle-Invasive Madder tumors 



Pkey. Unique Eos probeset Identifier number 
ExAcck Exemplar Accession number, Genbank accession number 
UnigenelD: Unigene number 
Unigene Title: Unigene gene title 
R1 80th percenffle of T2-T4 tumor Als divided 



Pkey ExAccn UnigenelD Unigene Title 

423961 D13666 Hs.136348 periostin (OSF-2os) 

421948 L42583 Hs.334309 keratin 6A 

401780 NMJX)5557*:Homo sapiens keratin 16{foca 
446619 AU076643 Hs.313 secreted phosphoprotein 1 (osteopontin, 
444381 BE387335 Hs.283713 ESTs, WeaWy similar to S64054 hypotheti 
439926 AW014875 Hs.137007 ESTs 

408243 Y00787 Ks.624 InterteukmS 

414183 AW957446 Hs.301711 ESTs 

411573 AB029000 Hs.70823 WAA1077 protein 

414522 AW518944 Hs.76325 step II splicing factor SLU7 

413063 AL035737 Hs.75184 chitinase 34ike 1 (cartilage glycoprote 

441533 AW958544 Hs. 112242 normal mucosa of esophagus specific 1 

427337 Z46223 Hs.176663 Fc fragment of IgG, low affinity I lib, r 

420859 AW458397 Hs. 100000 S 1 00 calcium-binding protein A8(calgran 

422168 AA586894 Hs. 11 2408 SlOOcaicium-binding protein A7(psorias 

418870 AF147204 Hs.89414 chemokine (C-X-C motiO, receptor 4 (fus 

401781 Target Exon . 

421116 T19132 Hs.101850 retinol-binding protein 1, cellular 

448429 D17408 Hs.21223 calponin 1, basic, smooth muscle 

414020 NM 002984 Hs.75703 small inducible cytokine A4 (homologous 

447526 AL048753 Hs.303649 small inducible cytokine A2 (monocyte ch 

424247 X14008 Hs.234734 tysozyme (renal amyloidosis) 

456525 AW46B397 Hs.1 00000 S 1 00 calcium-binding protein A8(calgran 

418007 M13509 Hs.83169 matrbc metailoproteinase 1 (interstitial 

405663 U24683 Hs.293441 lrnmunogbBoUn heavy constant mu 

425593 AA278921 Hs.1908 proteoglycan 1, secretory granule 

433336 AF017986 Hs.31386 secreted frizzted-related protein 2 

425118 AU076611 Hs.1 54672 methylene tetrahydrofolate dehydrogenase 

41 5994 NM 002923 Hs.78944 regulator of G-protein signalling 2, 24k 

412326 R07566 Hs.73817 small Inducible cytokine A3 (homologous 

422158 L10343 Hs.112341 protease Inhibitor 3, skin-derived (SKAL 

446921 AB012113 Hs.16530 small tnducibte cytokine subfamily A (Cy 
433470 AW960564 transmembrane 4 superfamily member 1 

417880 BE241595 Hs.82848 selectin L (lymphocyte adhesion molecule 

446500 U78093 Hs.1 51 54 sushi-repeat-containing protein, X chrom 

413324 V00571 Hs.75294 corOcotropin releasing hormone 
436729 BE621B07 transmembrane 4 superfamity member 1 

450455 AL1 17424 Hs.25035 chloride intracellular channel 4 

413731 BE243845 Hs.75511 connective tissue growth factor 

412429 AV650262 Hs.75765 GR02 oncogene 

418283 S79895 Hs.83942 cathepsln K (pycnodysostosis) 

418299 AA279530 Hs.83968 integrin, beta 2 (antigen CD18 (p95), ry 

420899 NM.001629 Hs.100194 arachldonate 54ipoxygenase-activating p 

400288 X06256 Hs.149609 Integrin. alpha 5 (fibronectin receptor, 

437446 AA788946 Hs.101302 ESTs, Moderately similar to CA1C RAT COL 

413441 AI929374 Hs.75367 Src-like-adapter 
404854 Target Exon 

431319 AA873350 Hs.302232 ESTs , . 

452432 AW206008 Hs.283378 Hdmo sapiens cDNA FU21778 fis, clone n 

429679 NM_006290 Hs.211600 tumor necrosis factor, alpha-induced pro 

428330 L22524 Hs.2256 matrix metailoproteinase 7 (matrilysin, 

408380 AF 123050 Hs.44532 diubiquitin 

431103 M57399 Hs.44 plelotrophin (heparin binding growth fac 

422545 X02761 Hs.287820 fibronectin 1 

418203 X54942 Hs.83758 COC2B protein kinase 2 

409956 AW103364 HsJ27 inhibin, beta A (eciivin A, acthrin AB a 

406687 M31 126 matrix metailoproteinase 1 1 (stromelysin 

414359 M62194 Hs.75929 cadrterin11.type2.0B^erin(osteob 

417259 AW903838 Hs.81800 chondroltin sulfate proteoglycan 2 (vers 

417497 AW402482 Hs.82212 C053 antigen 

449335 AW1 50717 Hs.345728 STAT induced STAT inhibitor 3 

445033 AV652402 Hs.72901 mucin 13. epithefial transmembrane 

427274 NM 00521 1 Hs.174142 colony stimulating factor 1 receptor, to 

427527 AI809057 Hs.293441 Immunoglobulin heavy constant mu 

409142 AL 136877 Hs.50758 SMC4 (structural maintenance of chromoso 

453331 A1240665 ESTs 

428036 AW068302 Homo sapiens mRNA for caWesmon, 3" UTR 

417366 BE1 85289 Hs.1076 small proline-rich protein IB (comifin) 



R1 

11.22 

10.87 

9.16 

8.40 

7.88 

7.73 

7.54 

7.00 

6.52 

6.42 

6.14 

6.04 

5.66 

5.62 

5.51 

5.48 

5.46 

5.41 

5,41 

5.32 

5.32 

5.27 

5.22 

5.17 

5.08 

4.93 

4.89 

4.74 

4.64 

4.39 

4.30 
4.29 
4.23 
4.22 
4.22 
4.20 
4.18 
4.15 
4.09 
4.00 
4.00 
4.00 
3.97 
3.95 
3.94 
3.91 
3.81 
3.77 
3.76 
3.72 
3.58 
3.58 
3.57 
3.52 
•3.49 
3.48 
3.41 
3.36 
3.32 
3.30 
3.25 
3.23 
3.23 
3.22 
3.18 
3.15 
3.10 
3.09 
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10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 



414522 

418478 

417771 

413936 

40S755 

426653 

443623 

443907 

422048 

410204 

438973 

420202 

422626 

442402 

413902 

434858 

407207 

438855 

430413 

424909 

419938 

416819 

422562 

414081 

426406 

443950 

418323 

414420 

415149 

415213 

421848 

452516 

427676 

415198 

424390 

426825 

452363 

407694 



AI752666 

U38945 

AA804698 

AF113676 

N80129 

AA530892 

AA345519 

AU076484 

NH.012445 

AJ243425 

AW959503 

AL036557 

AA344932 

NMJW0954 

AU076743 

R50032 

T03651 

AW946276 

AW842182 

S78187 

AU076772 

077735 

AI962060 

AW969976 

AI742501 

NM.001425 

NM_00211B 

AA043424 

X12451 

NM.002933 

X15880 

AA058630 

AA394062 

AW009480 

AW815657 

AL133415 

AI582743 

U77594 



Hs.76669 

Hs.1 174 

Hs.82547 

Hs.297681 

Hs.94380 

Hs.171695 

Hs.9641 

Hs.9963 

Hs.288126 

Hs.326035 

Hs.60440 

Hs.95910 

Hs.1 18786 

Hs.8272 

Hs.75613 

Hs.159263 

Hs.336780 

Hs.6441 

Hs.241392 

Hs.153752 

Hs.1279 

Hs.60205 

Hs.1 18397 

Hs.279009 

Hs.169756 

Hs.9999 

Hs.1 162 

Hs.76095 

Hs.78056 

Hs.78224 

Hs.108885 

Hs.29759 

Hs.300772 

Hs.943 

Hs.182241 

Hs.297753 

Hs.94953 

Hs.37682 



rucoflnamide N-methyltransferase 
cycfin-dependent kinase Inhibitor 2A (me 
relinoic add receptor responder (tazaro 
serine (or cysteine) proteinase inhibtto 
metaliothlonetn 1L 
dual specificity phosphatase 1 
complement component 1, q subcomponent, 
TYRO protein tyrosine Hnase binding pro 
spondin 2, extracellular matrix protein 
eariy growth response 1 
ESTs, Weakly similar to serin protease w 
putative lymphocyte G0/G1 switch gene 
metallothionein 2A 

prostaglandin 02 synthase (21kD, brain) 

CD36 antigen (collagen type I receptor, 

collagen, type VI, alpha 2 

tubulin, beta polypeptide 

Homo sapiens mRNA; cDNA DKFZp586J021 (fr 

small inducible cytokine A5 (RANTESJ 

cell division cycle 258 

complement component 1, r subcomponent 

pin>2 oncogene 

AE-binding protein 1 

matrix Gta protein 

complement component 1, s subcomponent 
epttheflai membrane protBin 3 
major histocompatibility complex, class 
immediate eariy response 3 



ribonudease, RNase A family, 1 (pancrea 
collagen, type VI, alpha 1 

RNA POLYMERASE I AND TRANSCRIPT RELEASE 

tropomyosin 2 (beta) 

natural killer cell transcript 4 

Interferon Induced transmembrane protein 

vimentin 

Homo sapiens, Similar to complement comp 
retinoic acid receptor responder (lazaro 



3.07 

3.02 

2.77 

£75 

2.75 

2.67 

Z65 

2.64 

2.51 

2.46 

2.46 

144 

144 

2.43 

142 

2.42 

2.30 

2.29 

2.20 

118 

117 

111 

107 

107 

2.03 

101 

1.94 

1.90 

1.72 

1.70 

1.69 

1.64 

1.64 

1.60 

1.59 

1.51 

146 

1.44 



TABLE 11 B_ 



Pkey: Unique Eos probeset identifier number 
CAT number Gene cluster number 
Accession: Genbank accession numbers 



Pkey 
428036 



CAT Number 
28620.1 



AW793910 AL047737 AV659047 AV659632 A1750389 AA092053 AA092798 H85367 T61597 R23745 220418 T78485 AI751528 AW06BU1 
AA853188 A1752459 AA85371 1 AW950663 R78964 R3S359 R21626 R21522 
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WO 03/003906 



PCTAJS02/21338 



433470 3672.1 



435729 42585.1 



453331 



96214.1 



AW950554 AA092457 T55890 D56120 T92525 AI815987 BE182608 BE182595 AW080238 M90657 AA347236 AW981686 AW176446 
AA304671 AW583735 T61 714 AA31 6968 AI446615 AA343532 AA0B3489 AA48B005 W5209S W39480 N57402 D82538 W25540 W52847 
082729 D58990 BE619182 AA315188AA308636 AA1 12474 W76162 AA088544 H52265 AA301631 H80982AA1 13786 BE620997AW651691 
AA343799 BE613669 BE547180 BE54S556 F11933 AA376800 AW239185 AA376086 8E544387 BE619041 AA452515 AA001806 AA190873 
AA1804B3 AA159546 F00242 AI940609 A1940602 All 89753 T97663 T66110 AW062896 AW062910 AW062902 AI051622 AI828930 
AA102452 A1685095 AI819390 AA557597 AA383220 AI804422 AJ633575 AW338147 AW603423 AW606800 AW750567 AW510672 AI250777 
AA083510AW629109AW513200 M921353AI677934A1148698AI955858M173825M453027W 
A1591384 R79300R80023AA843108AA62H^ 

aS^AI24S30N47475H38178AA366621 AA1 13196 AA1 30023 H39740 T61629 AI885973 AW083671 AA179pOAA305757AJ285455 
N83956 AA216013 AA336155 AW999959 T97525 AA345349 T91762 AA771981 A12B5092 AI591386 BE392486 BE385B52 AA682601 
AI682884 AA345840 T85477 AA292949 AA932079 AA098791 D82607 T48574 AW752038 006300 
BE62W^^lSS 

A^OT1625 AI6^M2 C177^ D57559 H61 762 AI720939 AI262930 H27252 AA47971 2 A1927769 AA291 465 
M421678AI925607AA292956M192^^ 

AI424384A1093510AI240988AVV820230AM92554BE044033AW262737AW008^^ 

AA088439 AW806193 AWB06183 AA479834 BE501 957 AA129574 R381 14 AA649494 AA524526 BE327120 AW2531 BK19784 BEN9186 
AW01CT24 AA043217 AW772000 Al 799814 Al 67 1727 AW779725 AA502832 AI470033 AA 129575 W381 61 A1972739 AA4Q4570 AA627686 

AA182640 AI990827 AW275MBAW1 03470 A1298935 AW471421 R791 90 AW0851 58 ^^^q 
AI823466A1692846 M962397AW 

AA037696 AA190966 AA757735 AW772283 AA010631 H80983 AI769516 H64985 AI061065 AI950693 1 AA085492 AU45632 rTO94 
ATO8968 BE156360 AI349390 A1621320 AI738844 AW194272 AA148284 AA953883 C06365 AA487893 AJ92721 ^918523 AI453453 
S2AI18^5W261 359 A1032569 AW338678 AI972899 AI500576 A1872628 A1693030 Z2877 1 AI985583 A1363B29 AW3^301 

AI918451 AA035576 A1472527 AW351556 AA191414 AW674145 D57558 AI446740 D57845 AI589264 C05782 AA722206 AM32033 R21752 

AI14B877 AI798415 D1 1921 A1377596 AI983655 A1744233 C061 1 1 AI248307 AA948565 AI224807 

™^q7R319fl1 H12498 H02668 AA035018 R75957 AI803329 R27528 R36203 AI809932 AI808765 R78948 AA41 1 449 AA976929 AJ378760 



TABLE 11C 



Pkev Unique number corresporwiing to an Eos probeset 
Ref Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers, Dunham I. et al. 

sequence of human chrorrosorne 22.' Dunham I. et al., Nature (1999) 402:489-495. 
Strand: Indicates DNA strand from which exerts were predicted. 
NLposltion: Indicates nucleotide positions of predicted exons. 



refers to the publication entitled "The DNA 



Pkey Ref 

401780 7249190 

401781 7249190 
404854 7143420 



2 28^2^17,2892^ 
Plus 14260-14537 
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TABLE 12A: Genes preferentially expressed in muscte-Invasive Madder turners 

Ptey: Unique Eos probeset idenlifer number 

ExAocn: Exemplar Accession number, Genbank accession number 

Un&enelD: Unigene number 

Unigene Title: Unigene gene title 

Seq iD No: Sequence Identification Number finking Information in Table 1 2A to sequences tn Table 1 3 



Pkey 


ExAocn 


UnigensID 


Unigene Title 


Sep 10 Na 






Up i/OCftQ 

ns.i4yoU3 


integrtn, alpha 5 (fibronectin receptor, 


Seq ID No. 1 & 2 




YfT7ft9fi 
AU/olU 


nS.Z/oo 


matrix meiaDoprotemase 10(5tromelystfi 


Seq ID No. 3 & 4 




Ml jouy 


Up 0*51 CO 

MS.oJlby 


matrix metaHoprotetnase 1 (interstitial 


Seq ID No. 5 & 6 


f 10/00 


AUnttOft'i'i 


lie ££0O 

nS.oooV 


solute carrier fsmSy 7, (caHonic amnio 


SeqfONo. 7&8 








rbcNtb predicted novel secreted protein 


Seq ID No. 9 & 10 


HOC! 0 


Aruoouoo 


ns.lscoiz 


G protein-coupled receptor 51 


Seq ID No. 11 & 12 


428227 


AA0^104y 


Ue OOitfl 


small inducible cytokine subfamily B (Cy 


Seq ID No. 13 & 14 




AWZ400UO 


y_ 070717 


bo i s, nomoiogue oi r bM-o [uona savignyi 


Seq ID No. 15 & 16 


4«Z4r 


NMJJU034U 


HS. 155324 


matrix metaHoproteinase 11 (stromelysln 


Seq ID No. 17 & 18 




UOlOrO 


11 _ 071.400 

nS.£/<!4yy 


short-chain alcohol dehydrogenase family 


Seq ID No. 19 & 20 


HIV l OS 






nomo sapiens cuna; rLJ2iZ45 ns, aons C 


Seq ID No. 21 & 22 


41D01 1 


A1732617 


Hs. 182362 


ESTs 


Seq ID No. 23-25 


AC') ACt 




Ue inoinc 


transcription factor 


Seq ID No. 26 & 27 


AiVitA 


V00571 


Hs.75294 


corticotropin releasing hormone 


Seq ID No. 28 & 29 


a Ana 


A H 90*900 

Anzoooa 


Hs. 143655 


ESTs 


Seq ID No. 30 




AYVUI40/0 


Ue 1 v?nm 
nS.l J/UUf 


bo IS 


Seq ID No. 31 & 32 


<W£Z£l 






gD:anuocuo.xi buatagene scnizo Drain oi 


Seq ID No. 33 


infto/uj 
lUoyuo 




nS.idUoiZ 


serineAhreonine kinase 15 


Seq ID No. 34 & 35 


4401/1 




Ue onin 


IUNUU 


Seq ID No. 35 & 37 




WWW/ f 


Ue CTT70 


CCTe 
bo IS 


Seq ID No. 38 & 39 


410553 


AW0 16824 


Hs.272058 


hypothetical protein MGC14128 


Seq ID No. 40 & 41 


419183 


i icnccn 


HS.896&3 


cytochrome P450, subfamily XXIV {vitamin 


Seq ID No. 42 & 43 


425721 


AC002115 


Hs. 159309 


uropiakin 1A 


Seq ID No. 44 & 45 


420370 


Y13545 


Hs.97234 


uroptakm 2 


Seq ID No. 46 & 47 


437852 


BE001B36 


Hs.256897 


^ Uf.-LL * ** J 1^CCA41 4 fl ) ^_ 

ESTs, Weakly similar to dJ36501Z1 [H.sa 


Seq ID No. 48 & 49 


428654 


AKOOIdoo 


Hs. 189095 


similar to SALL1 (sal (DrosophilaHike 


Seq ID No. 50 & 51 


456034 


AW45Q979 




gb:Ul-H-Bl3-ala-a-12-0*Ul.si Nd_CGAP„5u 


Seq ID No. 52 


421110 


AJ250717 


Hs.1355 


cathepsln E 


Seq ID No. 53 & 54 


451668 


Z43948 


Hs.326444 


cartilage acidic protein 1 


Seq ID No. 55 - 60 


408243 


Y00787 


Hs.624 


interteukin 8 


Seq ID No. 61 & 62 


440304 


BE159984 


Hs. 125395 


ESTs 


Seq ID No. 63 & 64 


414918 


AI219207 


Hs.72222 


hypothetical protein FU 13459 


Seq ID No. 65 & 66 


418867 


D31771 


Hs.89404 


msh (Oosophila) homeo box homolog 2 


Seq ID No. 67 & 68 


426088 


AF038007 


Hs.166196 


ATPase, Class I, type 86, member 1 


Seq ID No. 69 & 70 


405033 






f*4 AAorf*ot..iPr* j j fio^UaiAn jiTnniF^Hrtc oadit 

C 1 002o52^gi(544327pp{QD4799jFMO5 JwBiT 


Seq ID No. 71 & 72 


422282 


AF019225 


Hs.1 14309 


apoUpoprotein L 


Seq ID No. 73 & 74 


425852 


AK001504 


Hs.1 59651 


death receptor 6, TNF superfarnfly member 


Seq ID No. 75 & 76 


400844 






NM„00310y:Homo sapiens soniiin-raaiBd 


Seq ID No. 77 & 78 


404875 






NM_0228iy:Homo sapiens pnospnoTtpase A2 


Cam in Ma 70 0 OA 

Seq ID No. 79 & 80 


422809 


AK001379 


Hs.1 21 028 


hypothetical protein FU10549 


C _„ irv »i_ Q4 0 DO 

Seq ID No. 81 & 82 


431347 


All 33461 


Hs.251664 


insulin-tike growth factor 2 (somatomedi 


Seq ID No. 83 & 84 


413804 


T64682 




gb.7c48b02.r1 Stratagene liver (937224) 


Seq ID Na 85 & Bo 


444163 


Al 126098 




FGENESH predicted RNaseH oornain-containl 


Seq ID No. 87 - B9 


444444 


AI149332 


Hs. 14855 


ESTs 


Seq ID No. 90S 91 


Htl ill 




rib. 1 


dCHIIB/UllCUHIIlO WlKOO It 


Sen ID No 92 & 93 


419741 


NM_007019 


Hs.93002 


ubiquffJn carrier protein E2-C 


SeqlDNo.94&95 


420281 


A1623693 


Hs.323494 


Predicted cation efflux pump 


Seq ID No. 96 & 97 


402305 






Cl9000735*:gi|4508027|reflNP_003414.1i z 


Seq ID No. 98 -100 


436608 


AA626980 




down syndrome critical region protein OS 


Seq ID No. 101 & 102 


427239 


BE270447 




ubtquitin carrier protein 


Seq ID No. 103 & 104 


413278 


BE563085 


Hs.833 


interferon-stlmulated protein, 15k0a 


Seq ID No. 105 & 106 


424008 


R02740 


Hs.1 37555 


putative chemokine receptor; GTP-binding 


Seq ID No. 107 & 108 


414907 


X90725 


Hs.77597 


polo (Orosophia)-Iike kinase 


Seq ID No. 109 & 110 


421379 


Y15221 


Hs.103982 


small Inducible cytokine subfamily B (Cy 


SeqlDNo.11U112 


425028 


NtLOOMO 


Hs.172028 


8 {fislntegrin and metafloproteinase doma 


Seq ID No. 113&114 


417079 


U65590 


Hs.81134 


Interteukin 1 receptor antagonist 


SeqID No. 115& 116 


401093 






C 1 2000566*:gi[6330 1 67(dbj(BAA86477. 1 1 (A 


SeqfDNo.117-119 


422247 


U1B244 


Hs.1 13602 


solute carrier family 1 (high affinity a 


SeqID No. 120 & 121 


449722 


BE280074 


Hs.23360 


cydinBI 


Seq ID No. 122 & 123 


423673 


BE0030S4 


Hs.1695 


matrix metaQoprotelnase 12 (macrophage 


Seq ID No. 124 & 125 


418478 


U38945 


Hs.1 174 


cycQn-dependent kinase inhibitor 2A (me 


Seq ID No. 126-133 


439606 


W79123 


Hs.58561 


G protein-coupled receptor 87 


Seq ID No. 134 & 135 


418543 


NWL005329 


Hs.85962 


hyaluronan synthase 3 


Seq ID No. 136 & 137 


418678 


NM-001327 


Hs.167379 


cancerAestls antigen (NY-ESO-1) 


Seq ID No. 138-141 


428182 


BE386042 


Hs.293317 


ESTs, Weakly similar to GGC1_HUMAN G ANT 


Seq ID No. 142 & 143 


427335 


AA448542 


Ks^51677 


G anfigen 78 


SeqID No. 144 & 145 


409420 


215008 


Hs.54451 


laminln, gamma 2 (nlcein (100kD), kaSnl 


Seq ID No. 146 & 147 
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404440 



433091 


Y12642 


Hs.3185 


408380 


AF123050 


Hs.44532 


409893 


AW247090 


Ks.57101 


424905 


NM.002497 


Hs.153704 


438817 


AI023799 




42U7B 


A/683243 


Hs.97258 


431515 


NM.012152 


Hs.258583 


433159 


AB035898 


Hs.150587 


432441 


AW292425 


Hs.163484 


425726 


AF085808 


Hs.159330 


443045 


AJ297436 


Hs.20166 


431846 


BE019924 


Hs.271580 


437044 


AL035864 


Hs.69517 


444381 


BE387335 


Rs.283713 


400303 


AA242758 


Hs.79136 


452747 


BE153855 


Hs.61460 


400297 


AI127076 


Hs.306201 


428484 


AF104032 


Hs.184601 


429211 


AF052693 


Hs.198249 


417389 


BE260964 


Hs.82045 


445537 


AJ245671 


Hs.12844 


423961 


D13666 


Hs.138348 


417433 


BE270266 


Hs.82128 


444781 


NMJH4400 


Hs.11950 


430486 


BE062109 


Hs.241551 


425650 


NM.001944 


Hs.1925 


409103 


AF251237 


Hs.112208 


129404 


A1267700 




403047 






439738 


BE246502 


Hs.9598 



NM_Q21048:Homo sapiens melanoma antigen, Seq 10 No. 148 & 149 

lymphocyte antigen 6 complex, locus D Seq ID No. 150 & 151 

dhiblquifin Seq ID No. 152 & 153 

rranichromosome maintenance deficient (S. Seq fD No. 154 & 155 

MIMA (never in mitosis gene a)-related k Seq ID No. 156 & 157 

ESTs Seq ID No. 158 

ESTs, Modara&ety similar to S29539 ribos Seq ID No. 159 & 160 

EOG-7 (endothelial differentiation, Kys Seq ID No. 1 61 & 1 62 

kinesin-Jike protein 2 Seq ID No. 163 & 184 

ESTs Seq ID No. 165 

urop1a1dn3 Seq ID No. 166 & 167 

prostate stem cell antigen Seq ID No. 168 & 169 ' 

uroplakinIB Seq ID No. 170 & 171 

differentially expressed in Fanconfs an Seq ID No. 172 & 173 

ESTs, Weakly similar to S54054 hypotheti Seq ID No. 174 & 175 

UV-1 protein, estrogen regulated Seq ID No. 176 & 1 77 

Ig superfamiiy receptor LNIR Seq ID No. 178 & 179 

hypothetical protein DKFZp56401 278 Seq ID No. 180 & 1 81 

solute carrier family 7 (cationic amino Seq ID No. 182 & 183 

gap junction protein, beta 5 (connexin 3 Seq ID No. 184 & 1 85 

midtfne (neuriie growffi-promoWrfg (actor Seq )D No. J86 & 187 

EGF-like-domain, multiple 6 Seq ID No. 188 & 169 

periosUn (OSF-2os) Seq ID No. 190 & 191 

5T4 oncofetal trophoblast glycoprotein Seq ID No. 192 & 193 

GPl-anchored metastasis-associated protein homolog Seq ID No. 194 & 195 

chloride channel, calcium activated. family member 2 Seq ID No. 196 & 197 

desmoglein 3 (pemphigus vulgaris antigen) Seq ID No. 198 & 1 99 

XAGE-1 protein Seq ID No. 200 & 201 

ESTs Seq ID No. 202 

NM_005656':Homo sapiens transmembrane protease Seq ED No. 203 & 204 

sema domain, irrimunogtobulin domain |lg), transmem Seq ID No. 205 & 206 



T ABLE iff 



Pkey: Unique Eos probeset identifier number 
CAT number: Gene dusler number 
Accession: Genbank accession numbers 



Pkey CAT Number Accession 

413804 1390710 1 T64682 BE168190 BE 168256 

427239 27647.1 BE270447 AW409921 BE207288 BE207170 D56355 BE263223 BE408171 BE262243 BE392439 BE292738 BE261776 BE314300 BE267719 

BE268715 BE513B76 BE295291 BE29706B AA210923 BE407519 H51344 BE622905 AW248281 AW250313 T19021 AA3551 15 AA316879 
BE269633 BE621936 AA290724 AA380031 AA993616 BE301331 W79220 H73020 AW797850 BE27461 1 BE314315 BE542407 BE562063 
8E3128S8 BE278817 BE252503 AA426470 M91670 NW.014501 BE264533 AI884863 AI884942 AIB78077 AA829937 A1869333 AW273060 
AA906924 AI571293 AW172642 AA582906 AW674596 AW300936 AW615753 AW615587 AW665495 AJ859296 AA594105 AA9281 10 
AI038859 A1589898 AA828316 AA938955 AA28371 1 AW170665 BE047759 AW732128 Ai0i5067 AW07074B AW248955 AA975490 
AA21 1097 AI826097 AA826730 AA994072 A1367867 W74508 AI086474 AA827720 AI922589 AW000868 W58033 AA991461 BE048131 
M91218 AA876186 AA661759 A1123879 A1089508 AW272915 AI566244 AW245061 AA688299 AW250988 A1953468 AI891 144 AW089131 
A1471577 AA293354 AA464019 BE044549 T29587 AW956171 BE537716 BE26940O AA918328 AI538087 AA969243 AW075033 BE139361 
A1340S69 A12501 28 A1247038 BE138953 AW075177 AI307208 BE049086 AW302327 AI054335 Al 345565 AJ334881 AI252075 A1254494 
AW075006 AW302733 AW302738 AJ054057 AI054217 AI054172 AI054302 AI053722 AJ054060 AI054079 AW075181 AI307473 AI312145 
A1340734 AI334909 AW071374 BE13B502 AW074809AW301901 AI251662 AI307559 BE1 39228 AI254764 AW073049 AI251 264 A1802B37 
AW271B67AI307442AW075100AVW73456AVV072496AI2707 

AW073656 AW072901 A1307493 A1255068 AI251289 AJ252160 AI271496 AI252427 AW073469 AW07 1420 AI270 156 A1252926 Al 252839 
AI252868 AW072520 AW073433 AI340643 AI802854 AI334733 AI583896 AW071 31 1 AI802853 AI28971 1 AI345036 AW07251 3 AI348921 
AI307478AI3(^62BE139315AVW71034AI334886AJ340619AW47CM78BE139260BE13B9B5AWM^^ 

432222 343347 1 A1204995 AW827539 AW969908AW440776 AA528756 

436608 42361 3 AA628980 Al 1 26603 BE504035 

438817 46559^.1 AI023799 AA826307 A16B3094 AI307373 A1870547 AW979007 

444163 593658J A1126098 All 84746 A11 48521 

455034 142696 1 AW450979 AA1 36653 AA1 35656 AW419381 AA984358AA492073BE1 68945 AAB09054AW238038BE01 121 2 BE011 359 BE01 1367 

BE011368 BE011362 BE011215 BE011365 BE011363 



TABLE 12C 

Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. et al." refers to the publication entitled The DNA 

sequence of human chromosome 22." Dunham L et at, Nature ( 1 999) 402:489-495. 
Strand: Indicates DNA strand from which axons were predicted. 
NLposition: Indicates nucleotide positions of predicted exons. 



Pkey Ref Strand NLposition 

400844 9188605 Plus 24746.2487Z25035-25204 

401093 8516137 Minus 22335-23166 



224 



WO 03/003906 



PCT/US02/21338 



402305 7328724 

404440 7528051 

404875 9801324 

405033 7107731 



Plus 4083241362 

Plus 80430-81581 

Phis 96588-96732,97722-97831 

Minus 142358-142548 



225 



WO 03/003906 



PCT/US02/21338 



Table 13 

Seq ID NOt 1 DNA sequence: 

Nucleic Acid Accession *h NMJ>02205 

Coding sequence; 24.. 3173 



1 11 21 31 41 SI 

| I 1 I I 1 

CAGGACAGGG AAGAGCGGGC GCTATGGGOA GCCGGACGCC AGAGTCCCCT CTCCACGCCG 60 

10 TGCAGCTGCG CTGGGGCCCC CGGCGCCGAC CCCCGCTCGT GCCGCTGCTG TTGCTGCTCG 120 

TGCCGCCGCC ACCCAGGGTC GGGGGCTTCA ACTTAGACGC GGAGGCCCCA GCAGTACTCT 180 

CGGGGCCCCC GGGCTCCTTC TTCGGATTCT CAGTGGAGTT TTACCGGCCG GGAACAGACG 240 

GGGTCAGTGT GCTGGTGGGA GCACCCAAGG CTAATACCAG CCAGCCAGGA GTGCTGCAGG 300 

GTGGTGCTGT CTACCTCTGT CCTTGGGGTG CCAGCCCCAC ACAGTGCACC CCCATTGAAT 360 

15 TTGACAGCAA AGGCTCTCGG CTCCTGGAGT CCTCACTGTC CAGCTCAGAG GGAGAGGAGC 420 

CTGTGGAGTA CAAGTCCTTG CAGTGGTTCG GGGCAACAGT TCGAGCCCAT GGCTCCTCCA 4 B0 

TCTTGGCATG CGCTCCACTG TACAGCTGGC GCACAGAGAA GGAGCCACTG AGCGACCCCG 540 

TGGGCACCTG CTACCTCTCC ACAGATAACT TCACCCGAAT TCTGGAGTAT GCACCCTGCC 600 

GCTCAGATTT CAGCTGGGCA GCAGGACAGG GTTACTGCCA AGGAGGCTTC AGTGCCGAGT 660 

20 TCACCAAGAC TGGCCGTGTG GTTTTAGGTG GACCAGGAAG CTATTTCTGG CAAGGCCAGA 720 

TCCTGTCTGC CACTCAGGAG CAGATTGCAG AATCTTATTA CCCCGAGTAC CTGATCAACC 780 

TGGTTCAGGG GCAGCTGCAG ACTCGCCAGG CCAGTTCCAT CTATGATGAC AGCTACCTAG 840 

GATACTCTGT GGCTGTTGGT GAATTCAGTG GTGATGACAC AGAAGACTTT GTTGCTGGTG 900 

TGCCCAAAGG GAACCTCACT TACGGCTATG TCACCATCCT TAATGGCTCA GACATTCGAT 960 

25 CCCTCTACAA CTTCTCAGGG GAACAGATGG CCTCCTACTT TGGCTATGCA GTGGCCGCCA 1020 

CAGACGTCAA TGGGGACGGG CTGGATGACT TGCTGGTGGG GGCACCCCTG CTCATGGATC 1080 

GGACCCCTGA CGGGCGGCCT CAGGAGGTGG GCAGGGTCTA CGTCTACCTG CAGCACCCAG 1140 

CCGGCATAGA GCCCACGCCC ACCCTTACCC TCACTGGCCA TGATGAGTTT GGCCGATTTG 1200 

GCAGCTCCTT GACCCCCCTG GGGGACCTGG ACCAGGATGG CTACAATGAT GTGGCCATCG 1260 

30 GGGCTCCCTT TGGTGGGGAG ACCCAGCAGG GAGTAGTGTT TGTATTTCCT GGGGGCCCAG 1320 

GAGGGCTGGG CTCTAAGCCT TCCCAGGTTC TGCAGCCCCT GTGGGCAGCC AGCCACACCC 1380 

CAGACTTCTT TGGCTCTGCC CTTCGAGGAG GCCGAGACCT GGATGGCAAT GGATATCCTG 1440 

ATCTGATTGT GGGGTCCTTT GGTGTGGACA AGGCTGTGGT ATACAGGGGC CGCCCCATCG 1500 

TGTCCGCTAG TGCCTCCCTC ACCATCTTCC CCGCCATGTT CAACCCAGAG GAGCGGAGCT 1560 

35 GCAGCTTAGA GGGGAACCCT GTGGCCTGCA TCAACCTTAG CTTCTGCCTC AATGCTTCTG 1620 

GAAAACACGT TGCTGACTCC ATTGGTTTCA CAGTGGAACT TCAGCTGGAC TGGCAGAAGC 1680 

AGAAGGGAGG GGTACGGCGG GCACTGTTCC TGGCCTCCAG GCAGGCAACC CTGACCCAGA 1740 

CCCTGCTCAT CCAGAATGGG GCTCGAGAGG ATTGCAGAGA GATGAAGATC TACCTCAGGA 1800 

ACGAGTCAGA ATTTCGAGAC AAACTCTCGC CGATTCACAT CGCTCTCAAC TTCTCCTTGG 1860 

40 ACCCCCAAGC CCCAGTGGAC AGCCACGGCC TCAGGCCAGC CCTACATTAT CAGAGCAAGA 1920 

GCCGGATAGA GGACAAGGCT CAGATCTTGC TGGACTGTGG AGAAGACAAC ATCTGTGTGC 1980 

CTGACCTGCA GCTGGAAGTG TTTGGGGAGC AGAACCATGT GTACCTGGGT GACAAGAATG 2040 

CCCTGAACCT CACTTTCCAT GCCCAGAATG TGGGTGAGGG TGGCGCCTAT GAGGCTGAGC 2100 

TTCGGGTCAC CGCCCCTCCA GAGGCTGAGT ACTCAGGACT CGTCAGACAC CCAGGGAACT 2160 

45 TCTCCAGCCT GAGCTGTGAC TACTTTGCCG TGAACCAGAG CCGCCTGCTG GTGTGTGACC 2220 

TGGGCAACCC CATGAAGGCA GGAGCCAQTC TGTGGGGTGG CCTTCGGTTT ACAGTCCCTC 2280 

ATCTCCGGGA CACTAAGAAA ACCATCCAGT TTGACTTCCA GATCCTCAGC AAGAATCTCA 2340 

ACAACTCGCA AAGCGACGTG GTTTCCTTTC GGCTCTCCGT GGAGGCTCAG GCCCAGGTCA 2400 

CCCTGAACGG TGTCTCCAAG CCTGAGGCAG TGCTATTCCC AGTAAGCGAC TGGCATCCCC 2460 

50 GAGACCAGCC TCAGAAGGAG GAGGACCTGG GACCTGCTGT CCACCATGTC TATGAGCTCA 2520 

TCAACCAAGG CCCCAGCTCC ATTAGCCAGG GTGTGCTGGA ACTCAGCTGT CCCCAGGCTC 2580 

TGGAAGGTCA GCAGCTCCTA TATGTGACCA GAGTTACGGG ACTCAACTGC ACCACCAATC 2640 

ACCCCATTAA CCCAAAGGGC CTGGAGTTGG ATCCCGAGGG TTCCCTGCAC CACCAGCAAA 2700 

AACGGGAAGC TCCAAGCCGC AGCTCTGCTT CCTCGGGACC TCAGATCCTG AAATGCCCGG 2760 

55 AGGCTGAGTG TTTCAGGCTG CGCTGTGAGC TCGGGCCCCT GCACCAACAA GAGAGCCAAA 2820 

GTCTGCAGTT GCATTTCCGA GTCTGGGCCA AGACTTTCTT GCAGCGGGAG CACCAGCCAT 2880 

TTAGCCTGCA GTGTGAGGCT GTGTACAAAG CCCTGAAGAT GCCCTACCGA ATCCTGCCTC 2940 

GGCAGCTGCC CCAAAAAGAG CGTCAGGTGG CCACAGCTGT GCAATGGACC AAGGCAGAAG 3000 

GCAGCTATGG CGTCCCACTG TGGATCATCA TCCTAGCCAT CCTGTTTGGC CTCCTGCTCC 3060 

60 TAGGTCTACT CATCTACATC CTCTACAAGC TTGGATTCTT CAAACGCTCC CTCCCATATG 3120 

GC^CCGCCAT GGAAAAAGCT CAGCTCAAGC CTCCAGCCAC CTCTGATGCC JGAGTCCTCC 3180 

CAATTTCAGA CTCCCATTCC TGAAGAACCA GTCCCCCCAC CCTCATTCTA CTGAAAAGGA 3240 

GGGGTCTGGG TACTTCTTGA AGGTGCTGAC GGCCAGGGAG AAGCTCCTCT CCCCAGCCCA 3300 

GAGACATACT TGAAGGGCCA GAGCCAGGGG GGTGAGGAGC TGGGGATCCC TCCCCCCCAT 3360 

65 GCACTGTGAA GGACCCTTGT TTACACATAC CCTCTTCATG GATGGGGGAA CTCAGATCCA 3420 

GGGACAGAGG CCCAGCCTCC CTGAAGCCTT TGCATTTTGG AGAGTTTCCT GAAACAACTG 3480 

GAAAGATAAC TAGGAAATCC ATTCACAGTT CTTTGGGCCA GACATGCCAC AAGGACTTCC 3540 

TGTCCAGCTC CAACCTGCAA AGATCTGTCC TCAGCCTTGC CAGAGATCCA AAAGAAGCCC 3600 

CCAGTAAGAA CCTGGAACTT GGGGAGTTAA GACCTGGCAG CTCTGGACAG CCCCACCCTG 3660 

70 GTGGGCCAAC AAAGAACACT AACTATGCAT GGTGCCCCAG GACCAGCTCA GGACAGATGC 3720 

CACAAGGATA GATGCTGGCC CAGGGCCAGA GCCCAGCTCC AAGGGGAATC AGAACTCAAA 3780 
TGGGGCCAGA TCCAGCCTGG GGTCTGGAGT TGATCTGGAA CCCAGACTCA GAC*TJGGCA 3840 
CCAATCCAGG CAGATCCAGG ACTATATTTG GGCCTGCTCC AGACCTGATC CTGGAGGCCC 3900 
AGTTCACCCT GATTTAGGAG AAGCCAGGAA TTTCCCAGGA CCTGAAGGGG CCATGATGGC 3960 
75 AACAGATCTG GAACCTCAGC CTGGCCAGAC ACAGGCCCTC CCTGTTCCCC AGAGAAAGGG 4020 
GAGCCCACTG TCCTGGGCCT GCAGAATTTG GGTTCTGCCT GCCAGCTGCA CTGATGCTGC 4080 
CCCTCATCTC TCTGCCCAAC CCTTCCCTCA CCTTGGCACC AGACACCCAG GACTTATTTA 4140 
AACTCTGTTG CAAGTGCAAT AAATCTGACC CAGTGCCCCC ACTGACCAGA ACTAGAAAAA 4200 

226 



WO 03/003906 



AAAA 



Seq ID NO: 2 Protein sequence t 
Protein Accession «: NPJJ02196.1 

1 ll 21 31 41 51 

MGSRTPESPL HAVQUWGPR RRPPLVPLLL LLVPPPPRVG GFNLDAEAPA VLSGPPGSFP 60 

GFSVBFYRPG TDGVSVLVGA PKANTSQPGV LQGGAVYLCP WGASPTQCTP IEFDSKGSRL 120 

LESSLSSSEG EEPVEYKSLQ WFGATVRAHG SSILACAPLY SWRTEKEPLS DPVGTCYLST 180 

DNFTRILEYA PCRSDPSWAA GQGYCQGGFS AEFTKTGRW LGGPGSYFWQ GQILSATQEQ 240 

IAESYYPEYL 1NLVQGQLQT RQASSIYDDS YLGYSVAVGE FSGDDTEDPV AGVPKGNLTY 300 

GYVTIUJGSD IRSLYNFSGE QMASYPGYAV AATDVNGDGl* DDLLVGAPLL MDRTPDGRPQ 360 

EVGRVYVYLQ HPAGIEPTPT LTLTGHDEFG RFGSSLTPLG DLDQDGYNDV AIGAPFGGET 420 

QQGWFVFPG GPGGLGSKPS QVLQPLWAAS HTPDFFGSAL RGGRDLDGNG YPDLIVGSFG 480 

VDKAWYRGR PIVSASASLT IFPAMFNPEE RSCSLEGNPV ACINLSFCLN ASGKHVADSI 540 

GFTVELQLDW QKQKGGVRRA LFLASRQATL TQTLLIQNGA REDCREMKIY LRNESEFRDK 600 

LSPIHIALNF SLDPQAPVDS HGLRPALHYQ SKSRIEDKAQ ILLDCGEDNI CVPDLQLEVF 660 

GEQNHVYLGD KNALNLTFHA QNVGEGGAYE AELRVTAPPE AEYSGLVRHP GNFSSLSCDY 720 

FAVNQSRLLV CDLGNPMKAG ASLWGGLRFT VPHLRDTKKT IQFDFQILSK NLNNSQSDW 780 

SFRLSVEAQA QVTLNGVSKP EAVLFPVSDW HPRDQPQKEE DLGPAVHHVY ELINQGPSSI 840 

SQGVLELSCP QALEGQQLLY VTRVTGLNCT TNHPINPKGL ELDPEGSLHH QQKREAPSRS 900 

SASSGPQILK CPEAECFRLR CELGPLHQQE SQSLQLHFRV WAKTFLQREH QPFSLQCEAV 960 

YKALKMPYRI LPRQLPQKER QVATAVQWTK AEGSYGVPLW IIILAILFGL LLLGLLIYII* 1020 
YKLGFFKRSL PYGTAMEKAQ LKPPATSDA 

Seq ID NO i 3 DNA sequence t 

Nucleic Acid Accession tt: NM_002425 

Coding sequence: 23.. 1453 



1 11 21 31 41 51 

Iaagaaggta Lggcagtga GAATGATGCA TCTTGCATTC CTTGTGCTGT TGTGTCTGCC 

AGTCTGCTCT GCCTATCCTC TGAGTGGGGC AGCAAAAGAG GAGGACTCCA ACAAGGATCT 
TGCCCAGCAA TACCTAGAAA AGTACTACAA CCTCGAAAAG GATGTGAAAC AGTTTAGAAG 
AAAGGACAGT AATCTCATTG TTAAAAAAAT CCAAGGAATG CAGAAGTTCC TTGGGTJGGA 
GGTGACAGGG AAGCTAGACA CTGACACTCT GGAGGTGATG CGCAAGCCCA GGTGTGGAGT 
TCCTGACGTT GGTCACTTCA GCTCCTTTCC TGGCATGCCG AAGTGGAGGA AAACpCACCT 
TACATACAGG ATTGTGAATT ATACACCAGA TTTGCCAAGA GATGCTGTTG ATTCTGCCAT 
TGAGAAAGCT CTGAAAGTCT GGGAAGAGGT GACTCCACTC ACATTCTCCA GGCTGTATGA 
AGGAGAGGCT GATATAATGA TCTCTTTCGC AGTTAAAGAA CATGGAGACT ^CTCTTT 
TGATGGCCCA GGACACAGTT TGGCTCATGC CTACCCACCT GGACCTGGGC TTTATGGAGA 
TATTCACTTT GATGATGATG AAAAATGGAC AGAAGATGCA TCAGGCACCA ATTTATTCCT 
CGTTGCTGCT CATGAACTTG GCCACTCCCT GGGGCTCTTT CACTCAGCCA ACACTGAAGC 
TTTGATGTAC CCACTCTACA ACTCATTCAC AGAGCTCGCC CAGTTCCGCC TTT^CAA^ 
TGATGTGAAT GGCATTCAGT CTCTCTACGG ACCTCCCCCT GCCTCTACTG AGGAACCCCT 
GGTGCCCACA AAATCTGTTC CTTCGGGATC TGAGATGCCA GCCAAGTGTG ATCCTGCTTT 
GTCCTTCGAT GCCATCAGCA CTCTGAGGGG AGAATATCTG TTCTTTAAAG ACAGATATTT 

SSSSgI tcccactgga accctgaacc tgaatttcat ttgatttctg cat^tggcc 

CTCTCTTCCA TCATATTTGG ATGCTGCATA TGAAGTTAAC AGCAGGGACA CCGTTTTTAT 

ttttaaagga AATGAGTTCT GGGCCATCAG aggaaatgag gtacaagcag gttatccaag 
Iggcatccat accctgggtt ttcctccaac cataaggaaa attgatgcag ctgtttctga 
caaggaaaag aagaaaacat acttctttgc agcggacaaa tactggagat ™atgaaaa 

TAGCCAGTCC ATGGAGCAAG GCTTCCCTAG ACTAATAGCT GATGACTTTC CAGGAGTTGA 

gcctaaggtt gatgctgtat tacaggcait tggatttttc tacttcttca gtggatcatc 

ACAGTTTGAG TTTGACCCCA ATGCCAGGAT GGTGACACAC ATATTAAAGA GTAACAGCTG 

gSacattgc taggcgagat agggggaaga cagatatggg totttttaat aaatctaata 

ATTATTCATC TAATGTATTA TGAGCCAAAA TGGTTAATTT TTCCTGCATG TTCTGTGACT 

gaIgaagatg agccttgcag atatctgcat gtgtcatgaa gaatgtttct «amtcttc 

ACTTGCTTTT GAATTGCACT GAACAGAATT AAGAAATACT C^CAAT AGGTGAGAGA 
ATGTATTTTC ATAGATGTGT tattacttcc tcaataaaaa GTTTTATTTT GGGCCTGTTC 
CTT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 



Seq ID NO: 4 Protein sequence: 
Protein Accession #: NPJJ02416 



1 11 i 1 i 1 r r 

Lhlaflvll clpvcsaypl sgaakeedsn kdlaqqylek yynlekdvkq ^R^snliv 

WOQGMQKFL GLEVTGKLDT DTLEVMRKPR CGVPDVGHFS SFPGMPKWRK THLTYRIWY 
TPDipRDAVD SAIEKALKVW EEVTPLTFSR LYEGEADIMI SFAVKEHGDF YSFDGPGHSL 
AHAYPPGPGl* YGDIHFDDDE KWTEDASGTN LFLVAAHELG HSLGLFKSAN TEALMYPLYN 
SPTELAQFRL SQDDVNGIQS LYGPPPASTE EPLVPTKSVP SGSEMPAKCD f^SFDMST 

Irgeylffkd ryfwrrshwn PEPEFHLISA fwpslpsyld aayevnsrdt vfifkgnefw 

AIRGNEVQAG YPRGIHTLGF PPTIRKIDAA VSDKEKKKTY FFAADKYWRF DENSQSMEQG 

227 



60 
120 
180 
240 
300 
360 
420 



WO 03/003906 



PCT/US02/21338 



5 

10 

15 

20 

25 

30 

35 

40 

45 



FPRLIADDPP GVEPKVDAVL QAFGFFYFFS GSSQPEFDPN ARMVTHILKS NSWLHC 

Seq ID NO: 5 DNA sequence i 

Nucleic Acid Accession 8: NM_002421 

Coding sequencet 72..14B1 



11 



21 



GGGATATTGO 
CAAAGGCCAQ 
CTCACAGCTT 
ACCTGGAAAA 
GTGGCCCAGT 
GGAAACCAGA 
TGGCTCAGTT 
GGATTGAAAA 
CCTTCCAACT 
CAGACATCAT 
CTGGAGGAAA 
TTGATGAAGA 
CTCATGAACT 
ACCCTAGCTA 
AAGCCATATA 
CGTGTGACAG 
TTAAAGACAG 
TTTCTGTTTT 
GAGATGAAGT 
TACACGGATA 
TCGATGCTGC 
ACTGGAGGTA 
ATGACTTTCC 
ATTTCTTTCA 
TCCAGAAAGC 
AAAACACATG 
CATTTTTAAC 
CATATTTTTT 
ATGGTGGGTA 
GATCTTTTCC 
ATATCCTTTC 
CAAGAACACA 
ACAAAATAAG 



AGTAGCAAGA 
TATGCACAGC 
CCCAGCGACT 
ATACTACAAC 
GGTTGAAAAA 
TGCTGAAACC 
TGTCCTCACT 
TTACACGCCA 
CTGGAGTAAT 
GATATCTTTT 
TCTTGCTCAT 
TGAAAGGTGG 
CGGCCATTCT 
CACCTTCAGT 
TGGACGTTCC 
TAAGCTAACC 
ATTCTACATG 
CTGGCCACAA 
CCGGTTTTTC 
CCCCAAGGAC 
TCTTTCTGAG 
TGATGAATAT 
TGGAATTGGC 
TGGAACAAGA 
TAATAGCTGG 
GTGTGAGTCC 
CTCTAGAGTC 
TACTATTTAG 
CAAAAAGTCA 
AGAGTATGCA 
AAGACAGAAA 
TGTGCAGTCA 
TGTTTTATGT 



GGCTGGGAAG 
TTTCCTCCAC 
CTAGAAACAC 
CTGAAGAATG 
TTGAAGCAAA 
CTGAAGGTGA 
GAGGGGAACC 
GATTTGCCAA 
GTCACACCTC 
GTCAGGGGAG 
GCTTTTCAAC 
ACCAACAATT 
CTTGGACTCT 
GGTGATGTTC 
CAAAATCCTG 
TTTGATGCTA 
CGCACAAATC 
CTGCCAAATG 
AAAGGGAATA 
ATCTACAGCT 
GAAAACACTG 
AAACGATCTA 
CACAAAGTTG 
CAATACAAAT 
TTCAACTGCA 
AAAGAAGGTG 
ACTGATACAC 
AATGTAGCCC 
AGTTTGTGGC 
ACTCTGACGT 
GAGACAGGAG 
CTGGTGTCAC 
TTGGAATAAA 



31 

I 

CCATCACTTA 
TGCTGCTGCT 
AAGAGCAAGA 
ATGGGAGGCA 
TGCAGGAATT 
TGAAGCAGCC 
CTCGCTGGGA 
GAGCAGATGT 
TGACATTCAC 
ATCATCGGGA 
CAGGCCCAGG 
TCAGAGAGTA 
CCCATTCTAC 
AGCTAGCTCA 
TCCAGCCCAT 
TAACTACGAT 
CCTTCTACCC 
GGCTTGAAGC 
AGTACTGGGC 
CCTTTGGCTT 
GAAAAACCTA 
TGGATCCAGG 
ATGCAGTTTT 
TTGATCCTAA 
GGAAAAATTG 
TTTTCCTGAA 
AGAATATAAT 
TTTTTGTACT 
TTATGGATTC 
TGATCCCAGA 
ACATGAGTCT 
CCTGGATAGG 
GTCAACCTTG 



41 
1 

CCTTGCACTG 
GCTGTTCTGG 
TGTGGACTTA 
AGTTGAAAAG 
CTTTGGGCTG 
CAGATGTGGA 
GCAAACACAT 
GGACCATGCC 
CAAGGTCTCT 
CAACTCTCCT 
TATTGGAGGG 
CAACTTACAT 
TGATATCGGG 
GGATGACATT 
CGGCCCACAA 
TCGGGGAGAA 
GGAAGTTGAG 
TGCTTACGAA 
TGTTCAGGGA 
CCCTAGAACT 
CTTCTTTGTT 
TTATCCCAAA 
CATGAAAGAT 
AACGAAGAGA 
AACATTACTA 
GAACTGTCTA 
CTTATTTATA 
GATATAATTT 
ATATAGGCCA 
GAGCAGCTTC 
TTGCCGGAGG 
CAAGGGATAA 
TTTCTACTGT 



51 
I 

AGAAAGAAGA 
GGTGTGGTGT 
GTCCAGAAAT 
CGGAGAAATA 
AAAGTGACTG 
GTGCCTGATG 
CTGACCTACA 
ATTGAGAAAG 
GAGGGTCAAG 
TTTGATGGAC 
GATGCTCATT 
CGTGTTGCGG 
GCTTTGATGT 
GATGGCATCC 
ACCCCAAAAG 
GTGATGTTCT 
CTCAATTTCA 
TTTGCCGACA 
CAGAATGTGC 
GTGAAGCATA 
GCTAACAAAT 
ATGATAGCAC 
GGATTTTTCT 
ATTTTGACTC 
ATTTGAATGG 
TTTTCTCAGT 
CCTCAGTTTG 
AGTTCCACAA 
GAGTTGCAAA 
AGTGACAAAC 
AAAAGCAGCT 
CTCTTCTAAC 
TTT 



60 
120 
180 
240 
300 
360 
420 
4B0 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
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1740 
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1920 



Seq ID NO: 6 Protein sequence i 
Protein Accession ft: NP_002412 



50 



55 



60 



MHSFPPLLLL 
VEKLKQMQEF 
YTPDLPRADV 
LAHAFQPGPG 
TFSGDVQLAQ 
FYMRTNPFYP 
PKDIYSSFGF 
GIGHKVDAVF 



11 

! 

LFWGWSHSF 
FGLKVTGKPD 
DHAIEKAFQL 
IGGDAHFDED 
DDIDGIQAIY 
EVELNFISVF 
PRTVKHIDAA 
MKDGFFYFFH 



21 
I 

PATLETQEQD 
AETLKVMKQP 
WSNVTPLTFT 
ERWTNNFREY 
GRSQNPVQPI 
WPQLPNGLEA 
LSEENTGKTY 
GTRQYKFDPK 



31 
I 

VDLVQKYliEK 
RCGVPDVAQF 
KVSEGQADIM 
NLHRVAAHEL 
GPQTPKACDS 
AYEFADRDEV 
FFVANKYWRY 
TKRILTLQKA 



41 

I 

YYNLKNDGRQ 
VLTEGNPRWE 
ISFVRGDHRD 
GHSLGLSHST 
KLTFDAITTI 
RFFKGNKYWA 
DEYKRSMDPG 
NSWFNCRKN 



51 

I 

VEKRRNSGPV 
QTHLTYRIEN 
NSPFDGPGGN 
DIGALMYPSY 
RGEVMFFKDR 
VQGQNVLHGY 
YPKMIAKDFP 



Seq ID NO: 7 DNA sequence: 

Nucleic Acid Accession #: NMJ>14331 

Coding sequence* 1..1506 



60 
120 
180 
240 
300 
360 
420 



65 
70 
75 



I 

ATGGTCAGAA 
AACGGGAGGC 
AAGAGGAAAG 
GGAATCTTCA 
ACCATCTGGA 
GGAACAACTA 
TTACCAGCTT 
GTGATATCCC 
CCTGAACTTG 
AGCATGAGTG 
GCAATTCTGA 
TTTAAAGACG 



11 
I 

AGCCTGTTGT 
TGCCTTCCCT 
TCACTTTACT 
TCTCTCCTAA 
CGGTGTGTGG 
TAAAGAAATC 
TTGTACGAGT 
TGGCATTTGG 
CGATCAAGCT 
TCAGCTGGAG 
TAATTATAGT 
CGTTTTCAGG 



21 
I 

GTCCACCATC 
GGGCAACAAG 
GAGGGGAGTC 
GGGCGTGCTC 
GGTCCTGTCA 
TGGAGGTCAT 
CTGGGTGGAA 
ACGCTACATT 
CATTACAGCT 
CGCCCGGATC 
CCCTGGAGTT 
AAGAGATTCA 



31 
I 

TCCAAAGGAG 
GAGCCACCTG 
TCCATTATCA 
CAGAACACGG 
CTATTTGGAG 
TACACATATA 
CTCCTCATAA 
CTGGAACCAT 
GTGGGCATAA 
CAGATTTTCT 
ATGCAGCTAA 
AGTATTACGC 



41 
I 

GTTACCTGCA 
GGCAGGAGAA 
TTGGCACCAT 
GCAGCGTGGG 
CTTTGTCTTA 
TTTTGGAAGT 
TACGCCCTGC 
TTTTTATTCA 
CTGTAGTGAT 
TAACCTTTTG 
TTAAAGGTCA 
GGTTGCCACT 



51 
i 

GGGAAATGTT 
AGTGCAGCTG 
CATTGGAGCA 
CATGTCTCTG 
TGCTGAATTG 
CTTTGGTCCA 
AGCTACTGCT 
ATGTGAAATC 
GGTCCTAAAT 
CAAGCTCACA 
AACGCAGAAC 
GGCTTTTTAT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



228 



WO 03/003906 



PCT/US02/21338 



5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 



TATGGAATGT 
AACCCTGAAA 
TATGTGCTGA 
AATGCAGTGG 
ATCTTTGTTG 
TTATTCTATG 
CGCAAGCACA 
TTCTCTGGAG 
GGGCTGGCAG 
TTCAAGGTGC 
CTTTCCCTCT 
GTCCCTGCGT 
TCAGAGAAAA 
TT ATGAA CTA 
TTTTTACTTC 
CAGTTATTTT 
AACTCTATGT 
GTCTCTGATA 
TCTCTACAAC 
TATATATGGG 
TTTTCAATTC 
ATTTTACATT 
GCTTTAATGG 
TTAAAGAAGA 
AAAAATCCTT 
TTATCTGTCA 
AGCAAGAGTT 
TACCCCTGAT 
TGAGAGAAAT 
CTACATGCAA 
TGAATTTTGA 
CTTCAGATGA 
AAGAAATGTC 
ATCTAGGCTT 
CTGATAAGAA 
GTTTTGCCAG 
GCACTTTGGG 
CAACATGGAG 
GCTGGTAATC 
GAGGTTGCAG 
CCATCTCCAA 



ATGCATATGC 
AAACCATTCC 
CAAATGTGGC 
CAGTGACCTT 
CCCTCTCCTG 
TTGCGTCTCG 
CTCCTCTACC 
ACCTCGACAG 
TTGCTGGGCT 
CACTGTTCAT 
ATTCGGACCC 
ATTATCTCTT 
TAAOCAGAAC 
ATGGACTTGA 
ATTTTCTGAA 
TATTCATATA 
AGTTATAGAA 
CCTACCTATT 
ATATGTTAGC 
TTTTGTAAAG 
TGAAAAAAAG 
GACATTGCAT 
ATTATACCCA 
GTTTCTAGGG 
GAGAATTTAT 
TTTTTTTTTT 
AGTTTGGTAT 
GAGTCTATCT 
AACCAACAAA 
TGTTAGTAAT 
CAGTTTGTGC 
AACTGTCCAG 
GCTGTAAATA 
TGTCAGTAAT 
GAAAATTGAA 
TATTAGAAAA 
AGGCTGAGGG 
AAACCCCATC 
TCAGCTATTG 
TGAGCCAAGA 
AAAAAAAAAA 



TGGCTGGTTT 
CCTTGCAATA 
CTACTTTACG 
TTCTGAGCGG 
CTTTGGCTCC 
AGAGGGTCAC 
AGCTGTTATT 
TCTTTTGAAT 
GATTTATCTT 
CCCAGCTTTG 
ATTTAGTACA 
TATTATATGG 
ATTACAAATA 
GATCTTGGCA 
AGTCTAGAGA 
TTTTAGCATA 
AGTGAATATG 
GGGGTTAGGA 
ACGGCAAAGA 
ATGGTTTTAC 
CATACATCAT 
TGCTTCCCCT 
GAGCACTTTG 
GCTACTGTTT 
TATGTCAGAT 
TCACATCAGT 
TAAATCCTCA 
AAACATATGC 
GAAGATGTTC 
TCTGAAGTTT 
ATTTTCTTTA 
ATTAATTAGG 
AGATTTACAA 
TTCCACACCT 
ATGAGAATCT 
TACTGTGAGC 
GGTGGATCAC 
TCTACTAAAA 
AGGAGGCTGA 
TTGCACCACT 



TACCTCAACT 
TGTATATCCA 
ACCATTAATG 
CTACTGGGAA 
ATGAACGGTG 
CTTCCAGAAA 
GTTTTGCACC 
TTCCTCAGTT 
CGATACAAAT 
TTTTCCTTCA 
GGGATTGGCT 
GACAAGAAAC 
ATACTGGAAG 
ATCTGCCCAA 
ATTACAACTT 
TTCGAACTAA 
CAGTTATTCT 
GAAAAGACTA 
ACCTTCAAAT 
ACACTACAGA 
GATTATGGCA 
TAGATACCAA 
AACAAAGGTC 
ATGAGACACA 
GlTTTTTCAT 
TTGATCAGGA 
TTAGAACAAC 
ATTTTAAGCC 
AAAATAATAG 
TTTAAATTTA 
TACATTTTAT 
AAAAGGCATA 
CTGATGTTTC 
TAATTATCAT 
GTGGATAAGT 
CGGGCATGGT 
CTGAGGTCGG 
ATACAAAATT 
GGCAGGAGAA 
GTACTCCAGC 



TTGTTACTGA 
TGGCCATTGT 
CTGAGGAGCT 
ATTTCTCATT 
GTGTGTTTGC 
TCCTCTCCAT 
CTTTGACAAT 
TTGCCAGGTG 
GCCCAGATAT 
CATGCCTCTT 
TCGTCATCAC 
CCAGGTGGTT 
TTGTACCAGA 
GGGGAGACAC 
TGGTGATAAA 
TTTCTAAGAA 
ATGAGTOGCA 
GACAATTACT 
TGAAGACTGA 
TGTCTATACT 
AAGAGGAGAG 
TTTAGATAAC 
AGTGGGGATT 
TCCAGGAGTT 
TCATTATCAG 
AAGTGTATAA 
CACCTGTTTC 
TTCAAATTAC 
TCCCATATCT 
TGGCTATTTT 
ATTCTTCTGT 
TATTAACATA 
TAGAAAATTT 
TCAACTTGCA 
GTTTGTGTTC 
GGCTTACATC 
GAGTTCTAGA 
AGCTGGGCAT 
TTGCTTGAAC 
CTGGGTGACA 



AGAAGTAGAA 
CACCATTGGC 
GCTGCTTTCA 
AGCAGTTCCG 
TGTCTCCAGG 
GATTCATGTC 
GATAATGCTC 
GCTTTTTATT 
GCATCGTCCT 
catggttgcc 
TCTGACTGGA 
TAGAATAATG 
AGAAGATAAG 
AAAATAGGGA 
CAAAAGGAGT 
ATTTAGTTAT 
CAATTCTTGA 
ATGTGGTCAT 
GATTTTTCTG 
GTGAAAAGTG 
AAAGAAATrr 
AAACACTCAT 
GTTGAATACA 
ATGTTTAAGT 
GAAGTTTTAG 
CACATCTTAG 
ACTAATAACT 
ATTATCAACA 
GTAATCATAT 
TACACGATGA 
TAAAATATCT 
AAAATTGCAA 
CCACTTCTAT 
AAAGAGACAA 
AGAAGATGTT 
TGTAATCCCA 
CCAGCCTGAC 
GGTGGCACAT 
CCGGGAGGCG 
AAGTCAGACT 



780 
640 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 



Seq ID NO: 8 Protein sequence; 
protein Accession ftj NP_055146 



MVRKPWSTI 
GIPISPKGVL 
LPAFVRVWVE 
SMSVSWSARI 
YGMYAYAGWF 
NAVAVTFSER 
RKHTPLPAVI 
PKVPLPIPAL 
SEKITRTLQI 



11 
I 

SKGGYLQGNV 
QNTGSVGMSL 
LLIIRPAATA 
QIFLTFCKLT 
YLNFVTEEVE 
LLGNFSLAVP 
VLHPLTMIML 
FSFTCLFMVA 
ILBWPEEDK 



21. 
I 

NGRLPSLGNK 
TIWTVCGVLS 
VISLAFGRYI 
AILIIIVPGV 
NPEKTIPLAI 
IFVALSCFGS 
FSGDLDSLLN 
LSLYSDPFST 

h 



31 

1 

EPPGQEKVQL 
LFGALSYAEL 
LEPFFIQCEI 
MQLIKGQTQM 
CISMAIVTIG 
MNGGVFAVSR 
FLSFARWLFI 
GIGFVITLTG 



41 
1 

KRKVTLLRGV 
GTTIKKSGGH 
PELAIKLITA 
FKDAFSGRDS 
YVLTNVAYFT 
LFYVASREGH 
GLAVAGLIYL 
VPAYYLFIIW 



51 
I 

SIIIGTIIGA 
YTYILEVFGP 
VGITWMVLN 
SITRLPLAFY 
TXNAEELXiLS 
LPEILSMIHV 
RYKCPDMHRP 
DKKPRWFRIM 



60 
120 
180 
240 
300 
360 
420 
480 



Seq ID NO: 9 DNA sequence 
nucleic Acid Accession ft: 
Coding sequence : 1 . . 546 



ATGGCCTTGG 
TTCATGGCTG 
TTGGTGCAAA 
CAGCTCGCCG 
GGCTTTGACT 
TTCTGGTCCT 
CATTCTCCTC 
CAGTTTGTCA 
AAACCCTTTG 
AAGTAG 



11 
I 

GCAGCTCCGC 
GCATTAAGTG 
GGCTCCTGGG 
GTGCCCTCGA 
CCGTGTTGGC 
CAGGAGACAT 
TGAGCACTCC 
AAGGCCAGAA 
AAACTGGTTC 



21 
I 

CCCTGTGGCT 
TCTGTGGCTT 
TGGAGCTCGA 
CCTGCCCGCT 
CTCTCTGCCG 
GTCTGACTGG 
AGGGTGGAGC 
CTTGGACGTA 
CATGGTTCCA 



31 

s 

TTGCAGGGTA 
TTCCAGGTAG 
ACTGAAACTC 
GGGTCCTGTG 
TGGATTTTAA 
GACTACTGGG 
AGGCAGGTTA 
ACAGTGTACT 
TTCACCTTCA 



41 
I 

ATGCCCACTT 
TCCCCCTGGG 
GCTTTGTGCC 
CCTTTGAAGA 
ATGAGGAAGG 
TTGGCTGGCG 
GGCTCCAGTT 
GCAGGCTCCA 
TGTACTGGAT 



51 

I 

CCCTGCTGCT 
GCTCCCCGAG 
CGCAGCCCTG 
GAGCACTTGC 
CCAGCAACCT 
GAAGTTAATT 
GTTCCAGCTT 
GGGCAGTGAG 
CCACCATGGA 



60 

120 

180 

240 

300 

360 

420 

480 

540 



Seq ID NO: 10 Protein Sequence 
Protein Accession B« FGENESH 



229 



WO 03/003906 



1 11 21 31 41 SI 

1 I J I I 1 

MALGSSAPVA LQGNAHFPAA FMAGIKCLWL FQWPLGLPE LVQRLLGGAR TETRFVPAAL 60 

QLAGALDLPA GSCAFEESTC GFDSVLASLP WILNEEGQQP FWSSGDMSDW DYWVGWRKLI 120 

HSPLSTPGWS RQVRLQLFQL QFVKGQNLDV TVYCRLQGSE KPFETGSMVP FTFMYWIHHG 180 

K 



Seq ID NO: 11 DNA sequence 

Nucleic Acid Accession th EOS sequence 

Coding sequence: 461-3286 



11 



GCCGCCCCTG AGCAGCCTCG 
CCGAGCTCCA GGGGCTGCCG 
GGGGGCGGGC GGAGGCGGCC 
ACGCCGGCCC CCCTCTCCGC 
CCCACCCCTT CCCGGCGTGA 
GCGGCCGTTC TGAGCCGAGC 
GCCATTGCGC GGGCGCCGCG 
GCGGGCCGAG TGAGCCGGCG 
CGGGCAGCCC GGGCCGCCGC 
GCTGCTGCCG CTGCTGCTGC 
CCGGCCGCCG CCCAGCAGCC 
GGTGGCCAAG GGCAGCATCG 
GATCCGCAAC GAGTCACTCC 
GTGCGACAAC GCAAAAGGGT 
CTTGATGGTG TTTGGAGGCG 
AGGCTGGAAT CTGGTGCAGC 
AAAATACCCT TATTTCTTTC 
GAAGTTGCTC AAGCACTACC 
GTTCTCTGAG GTGCGGAATG 
AGACACCGAG AGCTTCTCCA 
TGTGCGGATC ATCCTTGGCC 
ATACGAGGAG AACATGTATG 
TTCTTGGTGG GAGCAGGTGC 
GCTTGCTGCC ATGGAGGGCT 
CAAGACCATC TCAGGAAAGA 
AGGCGTGGGG CCCAGCAAGT 
GACACTGCAG AGGGCCATGG 
CTTCAACTAC ACGGACCACA 
CTTCTTCGGG GTCACGGGTC 
ATTTACTCAA TTTCAAGACA 
CACACTGGAG ATCATCAATG 
GACCATCATC CTGGAGCAGC 
CCTCACCATC CTCGGGATGA 
GAATCAGAAG CTCATAAAGA 
GATGCTCTCC TATGCTTCCA 
GACCTTTGAA ACACTTTGCA 
TTTTGGGGCC ATGTTTGCAA 
GAAGAAGAAG ATCATCAAGG 
CGACCTGTGT ATCCTGATCT 
GTACAGCATG GAGCCGGACC 
CTGTGAGAAC ACCCATATGA 
CATGTTGTTC GGTTGTTTCT 
CGACAGCAAG TACATCGGGA 
CGCTGTCTCC TTCCTGACCC 
CATCATCTTC TGCAGCACCA 
GAGAACAAAC CCAGATGCAG 
GAAAGAAGAT TCTAAAACGT 
CCTGGAGGGC CTACAGTCAG 
AGACTTGGAA GAGGTCACCA 
ACAGAACCAC TACCAAGAGC 
AGATGGAGGA AAGGCCATTT 
CACAACAGAG CCCTCTCGAA 
CATCCAGCGT CGGCTGTCCC 
CGGAGGCGTG GACGCCAGCT 
CAGACATGTG CCACCCTCCT 
GGCCCGGGGC CTCCCCCGTG 
TGTCGGCTCT GGCTGCGGAG 
GGCACTCAGG TGGACAGGAC 
TGTGAAGTCC TTATTTCTTC 
CTGCAAACAA GGAGGCGCTG 
CAAAAAAACT AGACAAGGAG 
GAGCAGCCTT GGGAAGAGGC 
GGCTGTGACC CCTCAGGCTA 
AGGCCCCCAG CTGCCAGGCC 



21 31 41 SI 

III! 

CCTTCGCCTC CCGCGTTTCC TGCCGTCCGC CCTCCCCCGG 
CCTAGCAGCT CCCGGCGGGA GAGCGGTTCA GAGCGCGCAC 
CGGTGCGGGG CGGCCGCGCT GGAGAGAGGC GCGCGCGGAG 
GTTTGCTCGC TTGCTCCCCG CCTCCCGCAC TCCGCTCGCT 
TTGATCCGTC ACGGGCGCCG CCGCTGCCGC CGCCGCCGCC 
CGGAACCCTA GCCCGAGACG GAGCCGGGGC CCGGGCCGGC 
GGAAGACCTT GGCGCGGGGC GGCGGGCCGG GCCAGGCCAT 
CCCGCAGCCC GCGGCGCGGC ATGGCTTCCC CGCGGAGCTC 
CGCCGCCGCC ACCGCCGCCC GCGCGCCTGC TACTGCTACT 
CTCTGGCGCC CGGGGCCTGG GGCTGGGCGC GGGGCGCCCC 
CGCCGCTCTC CATCATGGGC CTCATGCCGC TCACCAAGGA 
GGCGCGGTGT GCTCCCCGCC GTGGAACTGG CCATCGAGCA 
TGCGCCCCTA CTTCCTCGAC CTGCGGCTCT ATGACACGGA 
TGAAAGCCTT CTACGATGCA ATAAAATACG GGCCTAACCA 
TCTGTCCATC CGTCACATCC ATCATTGCAG AGTCCCTCCA 
TTTCTTTTGC TGCAACCACG CCTGTTCTAG CCGATAAGAA 
GGACCGTCCC ATCAGACAAT GCGGTGAATC CAGCCATTCT 
AGTGGAAGCG CGTGGGCACG CTGACGCAAG ACGTTCAGAG 
ACCTGACTGG AGTTCTGTAT GGCGAGGACA TTGAGATTTC 
ACGATCCCTG TACCAGTGTC AAAAAGCTGA AGGGGAATGA 
AGTTTGACCA GAATATGGCA GCAAAAGTGT TCTGTTGTGC 
GTAGTAAATA TCAGTGGATC ATTCCGGGCT GGTACGAGCC 
ACACGGAAGC CAACTCATCC CGCTGCCTCC GGAAGAATCT 
ACATTGGCGT GGATTTCGAG CCCCTGAGCT CCAAGCAGAT 
CTCCACAGCA GTATGAGAGA GAGTACAACA ACAAGCGGTC 
TCCACGGGTA CGCCTACGAT GGCATCTGGG TCATCGCCAA 
AGACACTGCA TGCCAGCAGC CGGCACCAGC GGATCCAGGA 
CGCTGGGCAG GATCATCCTC AATGCCATGA ACGAGACCAA 
AAGTTGTATT CCGGAATGGG GAGAGAATGG GGACCATTAA 
GCAGGGAGGT GAAGGTGGGA GAGTACAACG CTGTGGCCGA 
ACACCATCAG GTTCCAAGGA TCCGAACCAC CAAAAGACAA 
TGCGGAAGAT CTCCCTACCT CTCTACAGCA TCCTCTCTGC 
TCATGGCCAG TGCTTTTCTC TTCTTCAACA TCAAGAACCG 
TGTCGAGTCC ATACATGAAC AACCTTATCA TCCTTGGAGG 
TATTTCTCTT TGGCCTTGAT GGATCCTTTG TCTCTGAAAA 
CCGTCAGGAC CTGGATTCTC ACCGTGGGCT ACACGACCGC 
AGACCTGGAG AGTCCACGCC ATCTTCAAAA ATGTGAAAAT 
ACCAGAAACT GCTTGTGATC GTGGGGGGCA TGCTGCTGAT 
GCTGGCAGGC TGTGGACCCC CTGCGAAGGA CAGTGGAGAA 
CAGCAGGACG GGATATCTCC ATCCGCCCTC TCCTGGAGCA 
CCATCTGGCT TGGCATCGTC TATGCCTACA AGGGACTTCT 
TAGCTTGGGA GACCCGCAAC GTCAGCATCC CCGCACTCAA 
TGAGTGTCTA CAACGTGGGG ATCATGTGCA TCATCGGGGC 
GGGACCAGCC CAATGTGCAG TTCTGCATCG TGGCTCTGGT 
TCACCCTCTG CCTGGTATTC GTGCCGAAGC TCATCACCCT 
CAACGCAGAA CAGGCGATTC CAGTTCACTC AGAATCAGAA 
CCACCTCGGT CACCAGTGTG AACCAAGCCA GCACATCCCG 
AAAACCATCG CCTGCGAATG AAGATCACAG AGCTGGATAA 
TGCAGCTGCA GGACACACCA GAAAAGACCA CCTACATTAA 
TCAATGACAT CCTCAACCTG GGAAACTTCA CTGAGAGCAC 
TAAAAAATCA CCTCGATCAA AATCCCCAGC TACAGTGGAA 
CATGCAAAGA TCCTATAGAA GATATAAACT CTCCAGAACA 
TCCAGCTCCC CATCCTCCAC CACGCCTACC TCCCATCCAT 
GTGTCAGCCC CTGQGTCAGC CCCACCGCCA GCCCCCGCCA 
TCCGAGTCAT GGTCTCGGGC CTGTAAGGGT GGGAGGCCTG 
ACAGAACCAC ACTGGGCAGA GGGGTCTGCT GCAGAAACAC 
AAGCTGGGCA CCATGGCTGG CCTCTCAGGA CCACTCGGAT 
GGGGCAGGGG GAGACTTGGC ACCTGACCTC GAGCCTTATT 
ACAAAGAAGA GGAACGGAAA TGGGACGTCT TCCTTAACAT 
GGATATCAAA CTTGCAAAAA AAAAAAAAAA AAAAAAAAAA 
AGAGGCACTA GAACTCCAGC TGGAAGTCAC GGAGTGGCTC 
AAGGAGCTTC TGAAGAAACT GCCTCTGCAC ACACATCACT 
GCCCTTCTCC ACTCTGGGGG AGGAGGTGGG AAGGGCCACC 
AGCTGACCCC AGCCTTCCTG GAACAGGGAG TCTGCAGGAG 



60 
120 
160 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
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1380 
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1500 
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1620 
1680 
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1800 
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2400 
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2760 
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2880 
2940 
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3360 
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3540 
3600 
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3780 
3840 
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CGCAGACAGG 
CTTACATGCT 
CCACACGTTA 
CAGACACGCT 
CTTGTGTTCC 
GTCTGTTCCC 
CCAAGGGCTG 
GAATTGGAGG 
GAGGGCTTCC 
CCTTGQGATC 
CCTTTAAATT 
CTCAGTCCCC 
TCAGTTTCCC 
GTCACTCTGA 
CGCCCTGATC 
TGTGGCCCAG 
CTCCCAGTGA 
ATGACAATGT 
TTGGCAGCTT 
CTGTTTTCTC 
TCAAAGCACC 
GGAGGTACTG 
CATTCTCTGT 
CTTGCTTTCT 
AGGGAAAGAG 
GAAATATTAT 
CTGTGCGTCT 
AAGTACATTC 
AAATAACAAG 
GCAAACACAC 
GTGCACAAAA 
CTCGTACCCC 
ATAAAAGGAA 



CACAGCCCTG 
CCACATGGGA 
ACACAATAAC 
CTGCAGCATT 
TATCGGATGC 
AGCAAACACG 
GCTGGGATTT 
CACTCTGCTA 
CTTGGCTTCC 
ACTGGGGTGA 
GAGCCCCTAA 
TGCCCACTCC 
CTCTGTAACT 
GGATTGAAAC 
OGTGAGGTAT 
AGAAGCTTCC 
CTTTTCTGAC 
CCAGGATGTC 
CAACAGGCAG 
CACCAGGCTT 
CTATTTCACG 
TAGCTTTTAG 
TTAGGCACTA 
TGATTTCACC 
ACAATTAATT 
TCCAGCATAG 
GTAACCCTTT 
TCTGTCTTAC 
ACCCACAGAA 
OGTCCATTTC 
CGCTTTAAAT 
TACTGCATGG 
AAAACCTGTC 



GAGCAGGCAG 
CCTGTGTCAC 
ACCCATTCCT 
CACCGACAGT 
TCCCAGTAAC 
ATAAAGAAAG 
TTTTTTTTTT 
AAAGACATCC 
AGAAGGCAGC 
TCAGTCAGCA 
GAGCACTGGC 
CTACGTGACT 
TGGGGTTGAA 
TTGATCTTGT 
GAGTATGACT 
AGCACAGCCC 
ATTCCTAGAC 
TGCACAACTG 
GCAATAATCT 
TGACAAAAGG 
TAGCATAGCT 
GGAAAAAAAA 
AAAATCGGTG 
AAAACCAAAT 
GTACATAATG 
CAGGAAAACA 
GTGA'CTCCTG 
TGATACTGTA 
GTGACTCTAG 
TGAAGAGGCT 
TAGACTGGAA 
TATCGATTTT 
GTCTTT 



GCCGAGGGCT 
CCAATGAGAT 
GGGACCGTGG 
CTGTCATGCA 
CAGGGGGACC 
ATTGTGCACT 
TTTCCCACTA 
CCGTAGACAT 
CTTCCATCCA 
GATTGATTCT 
CTGGGAGTCA 
TTGATCAGGT 
CTAAAACACC 
CCAGGAAAGC 
CTGACCTTCA 
TCTACCCAGA 
AGGTTCATTC 
AGCAGCTCGC 
CTTCCCAGAA 
GCCCACAGGA 
TTCTGCTCCC 
ATGTTAACAC 
TTGTCACTCA 
TTAATTTAAA 
TATACACACA 
AACAAAAGTA 
AGCGTGCGCT 
GGTTCACCCA 
CTACTTAATG 
TCGGCCTGAA 
CTGCCAGAAT 
TAATAAATTG 



GCGGCACTGG 
GTTTGTTACT 
GGATTTAGGG 
CCCACCACGT 
ACCCGAGCTA 
TTAACCTCTC 
ACTTTGTTTC 
AGGGGAGAGA 
GACAAGCCAG 
CATTCATAAG 
GACAGACCTG 
CACTAGTGTC 
TGTCCTGCCT 
TTTGTACCAA 
GCCCTCCCTA 
ACATCCGGGC 
TTTGCTCAAG 
TCACTCCCTA 
CCACTGCAGT 
ATCTTACCAA 
CTTCCCCAAA 
ATCACAGGTC 
CTGTGTATTA 
GGACCACATT 
CACAAAAAAA 
TTGGACTGTC 
GTCTTCTAGG 
TTTTTTTTTA 
GTTCTGTTCT 
GGCATTTTCC 
CAAATGTAAA 
TTGCAAATTT 



AGCAGGCTGA 
CTGGTAAATG 
CACGTCACTG 
TGGCCATGTC 
ATCATGGAAT 
TCATCAGGGC 
TGACCAAAGT 
GTTGCTGGCT 
TGAGCTCTCC 
ATCATTCCTC 
GGTTCAAGTC 
TCTCTGAGCC 
ACCTCACAAG 
ACAGTGAAGC 
CAGCCGGGGG 
TGGAGGGAGG 
AAAGGCCTGA 
AAGAAACCTA 
CAGGAATAAA 
TGCCAACATT 
GAGAGGTTAT 
AAGTTGAAGT 
CCAGTATTTA 
AATTTTTCAA 
AATACCTGTA 
GGAGGTGAGC 
TTAACTCACG 
ATTTCCTCGC 
TTTATATGCA 
AATGATGTTA 
TGAGGAATTT 
GTTTTTATGA 



3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 



Seq ID NOi 12 Protein sequence: 
Protein Accession ft: EOS sequence 



1 
I 

MASPRSSGQP 
LMPLTKEVAK 
IKYGPNHLMV 
AVNPAILKLL 
KKLKJGNDVRI 
RCLRKNLLAA 
GIWVIAKTLQ 
ERMGTIKPTQ 
LYSILSALTI 
GSFVSEKTFB 
VGGMLLIDLC 
YAYKGLLMLF 
FCIVALVIIF 
NQASTSRLEG 
GNFTESTDGG 
HAYLPSIGGV 



11 

I 

GPPPPPPPPP 
GSIGRGVLPA 
FGGVCPSVTS 
KHYQWKRVGT 
ILGQFDQNMA 
MEGYIGVDFE 
RAMETLHASS 
FQDSREVKVG 
LGMIMASAFL 
TLCTVRTWIL 
ILICWQAVDP 
GCFLAWETRU 
CSTITLCLVF 
LQSENHRLRM 
KAILKNHLDQ 
DASCVSPCVS 



21 
I 

ARLLLLLLLP 
VELAIEQIRN 
IIAESLQGWN 
LTQDVQRFSE 
AKVFCCAYEE 
PLSSKQIKTI 
RHQRIQDFNY 
EYNAVADTLE 
FFNIKNRNQK 
TVGYTTAFGA 
LRRTVEKYSM 
VSIPALNDSK 
VPKLITLRTN 
KITELDKDLE 
NPQLQWNTTE 
PTASPRHRHV 



31 
I 

LIiLPLAPGAW 
ESLLRPYFLD 
LVQLSFAATT 
VRNDLTGVLY 
NMYGSKYQWI 
SGKTPQQYER 
TDHTLGRIIL 
I INDTIRFQG 
LIKMSSPYMN 
MFAKTWRVHA 
EPDPAGRDIS 
YIGMSVYNVG 
PDAATQNRRF 
EVTMQLQDTP 
PSRTCKDPIE 
PPSFRVMVSG 



41 
I 

GWARGAPRPP 
LRLYDTECDN 
PVLADKKKYP 
GEDIEISDTE 
IPGWYEPSWW 
EYNNKRSGVG 
NAMNETNFFG 
SEPPKDKTII 
NLIILGGMLS 
IFKNVKMKKK 
IRPLLEHCEN 
IMCIIGAAVS 
QPTQNQKKED 
EKTTYIKQNH 
DINSPEHIQR 
L 



51 
I 

PSSPPLSIMG 
AKGLKAFYDA 
YFFRTVPSDN 
SFSNDPCTSV 
EQVHTEANSS 
PSKFHGYAYD 
VTGQWFRNG 
LEQLRKISLP 
YASIFLFGLD 
IIKDQKLIiVI 
THMTIWLGIV 
FLTRDQPNVQ 
SKTSTSVTSV 
YQELNDILNL 
RLSLQLPILH 



Seq ID NO: 13 DNA sequence 

Nucleic Acid Accession #: NM_001565. 

Coding sequence: 67-363 



60 
120 
180 
240 
300 
360 
420 
460 
540 
600 
660 
720 
780 
840 
900 



I 

GAGACATTCC 
AGCACCATGA 
ATTCAAGGAG 
CCTGTTAATC 
CGTGTTGAGA 
TCGAAGGCCA 
TAAAACCAGA 
CCTCTCCCAT 
GTTACACTAA 
GGTTAATGTT 
GCTCTACTGA 
ACCTTTCCCA 
TCAGAATCTC 
ACTTCATGGA 
CATACAATTC 
CTTATTTAAT 



11 
I 

TCAATTGCTT 
ATCAAACTGC 
TACCTCTCTC 
CAAGGTCTTT 
TCATTGCTAC 
TCAAGAATTT 
GGGGAGCAAA 
CACTTCCCTA 
AAGGTGACCA 
CATCATCCTA 
GGTGCTATGT 
TCTTCCAAGG 
AAATAACTAA 
CTTCCACTGC 
CAAACACATA 
GAAAGACTGT 



21 
1 

AG A CAT ATT C 
GATTCTGATT 
TAGAACCGTA 
AGAAAAACTT 
AATGAAAAAG 
ACTGAAAGCA 
ATCGATGCAG 
CATGGAGTAT 
ATGATGGTCA 
AGCTATTCAG 
TCTTAGTGGA 
GTACTAAGGA 
AAGGTATGCA 
CATCCTCCCA 
CAGGAAGGTA 
ACAAAGTATA 



31 

1 

TGAGCCTACA 
TGCTGCCTTA 
CGCTGTACCT 
GAAATTATTC 
AAGGGTGAGA 
GTTAGCAAGG 
TGCTTCCAAG 
ATGTCAAGCC 
CCAAATCAGC 
TAATAACTCT 
TGTTCTGACC 
ATCTTTCTGC 
ATCAAATCTG 
AGGGGCCCAA 
GAAATATCTG 
AGTCTTAGAT 



41 



GCAGAGGAAC 
TCTTTCTGAC 
GCATCAGCAT 
CTGCAAGCCA 
AGAGATGTCT 
AAATGTCTAA 
GATGGACCAC 
ATAATTGTTC 
TGCTACTACT 
ACCCTGGCAC 
CTGCTTCAAA 
TTTGGGGTTT 

ctttttaaag 

ATTCTTTCAQ 
AAAATGTATG 
GTATATATTT 



51 
I 

CTCCAGTCTC 
TCTAAGTGGC 
TAGTAATCAA 
ATTTTGTCCA 
GAATCCAGAA 
AAGATCTCCT 
ACAGAGGCTG 
TTAGTTTGCA 
CCTGTAGGAA 
TATAATGTAA 
TATTTCCCTC 
ATCAGAATTC 
AATGCTCTTT 
TGGCTACCTA 
TGTAAGTATT 
CCTATATTGT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 



231 



WO 03/003906 



TTTCAGTGTA CATGGAATAA CATGTAATTA AGTACTATGT ATCAATGAGT AACAGGAAAA 1020 
TTTTAAAAAT ACAGATAGAT ATATGCTCTG CATGTTACAT AAGATAAATG TGCTGAATGG 1080 
TTTTCAAATA AAAATGAGGT ACTCTCCTGG AAATATTAAG 



Seq ID NO: 14 Protein sequence: 
Protein Accession ft: NP_001555.1 

1 ll 21 31 41 51 

| | I I I I 

MNQTAILICC LIFIiTLSGIQ GVPLSRTVRC TCISISNQPV NPRSLEKLEI IPASQFCPRV 
EIIATMKKKG EKRCLNPESK AIKNLLKAVS KEMSKRSP 



60 



Seq ID NO: 15 DNA sequence 

Nucleic Acid Accession ft: Eos sequence 

Coding sequence: 241.. 1272 



11 

! 



21 



GCGCAGACGC 
CTCTACAAAG 
GTGCCCACCT 
TTGAGGGCCA 
ATGGTGACAG 
GAGCACTTCT 
CCTGCTCTGC 
GTGGTGGGCC 
ATCACACCAA 
GAGCGTGCGC 
TACAACAATG 
TCCGACGCCT 
AGCCTGGGCT 
GAGCAGGGCG 
CAGGATGTGT 
GCCACGCCCA 
GCCCGCGCTG 
GCCGGACTCC 
GGCGGCCTGC 
ACTGCCGCCC 
TGCGAGAGGA 
ATATTCTCCT 
ACCTGTTTTC 
TACTCGCTCG 
AGAGCCTCTG 
AGAGTTACAA 
TTGTTTTGCC 
TGTGTCACTC 
ACCTGTATCT 
CTAACATGAC 
TTTCCAGCAG 
GGGGAGAGAA 
CAAAGCTGGG 
GTTTCCCTTC 
CTTTCCTTTC 
GTCTTGTTCC 
CTTTGGGGAC 
ATTCCTCTTC 
ATTGATGGAA 
AAGGGTCAAA 
TGGATTTTCT 
AGGAAAGGGC 
GTGCCCGACT 
TTGAATCCTT 
TGGTTGAGGA 
GGAGGTGGGC 
TTGCTGCTTC 
TGGCCGGGCC 
GTGGGACTGT 
ATGGAGCAGG 
CACCTCTCCA 
CAGCCAAGAA 
GATGTCTGTA 
GGAGGGGGGT 
CCCACAAAGG 
AATTGAGCCA 
AGAAAGAGAG 
TAGCAAAAAA 
GTTTATTACC 
ATTAGGTAAT 



CCCAGCCCCC 
AGGCCGAGCT 
CCGAGCACGT 
AGACCAACAC 
GGCGACGGGA 
CCATGATCCG 
CCGGCCAGGT 
CCAAAGGGGC 
GCCGTGACCG 
GCGAGGAGAT 
AAAACGACTT 
GGCGGGTGCA 
GCATCGGCGA 
GGGACTTTGG 
ACTACQGCGT 
CCTCCGTGCT 
GGCCCCCGGG 
CGAGGCGCCC 
GGAGCCCCGG 
TTGTGCCCTG 
CGGACCCAGA 
AAGCCCCGTG 
CACTAAAGCC 
CTGGGGAAGG 
GTCACCCTGT 
CTCTGATACC 
AGAOGTGGCC 
CACCCCTTCT 
AGAGGTGCTC 
CCTTTACCTC 
TGTAAGGGAG 
GCATGTCAAT 
CTAGAGGAAT 
ACTCTCTCCG 
CCTCCAGCTC 
TTGTCTTTGT 
ACCATCCTTC 
AGGCATCTGG 
TGTATCCCCA 
CCCCCACCTT 
GGTTGTGGAA 
GTGAGGCAGG 
CCCTGCACCC 
ATTAGCTTGA 
GAAGCCAGGT 
AGCAGGGCCT 
CCTCCTGCTC 
AAGAGAGGAT 
TCTGAGTGAT 
TGGAGGTGCA 
CAGCCTTCCC 
AACTGCCCTG 
CAATAGAAAC 
GGGAAGAAGG 
CAGAAGACTG 
AACCTAGAAA 
CATGAGAGGG 
AAAAAAAAAA 
ATATAAATGT 
TTATTATATG 



31 
I 



CACCGCCCCC 
GCGCCTGAAG 
GGCCGAGATC 
CTACATCAAG 
GGACGTGGCC 
TGCCTCCCGC 
GACCATCCGT 
AACCATCAAG 
CGACCCCGTG 
CGAGACGCAC 
CCTGGCGGGG 
CCAGCCCGGC 
GTGCGGAGTG 
CTACGGCGGG 
GGCCGAGACT 
CTTCTCCTCT 
CGCACACCGC 
CCCGGGAGAG 
CGGCGGGCGG 
CGGACACAAC 
GTGTCCCGTC 
CCCCATGCCT 
TTTTGGAAAG 
GGGGAGGGAG 
CCTGGAAAGA 
TCAACACACC 
ACTAAGGCAT 
TCCGAGGAGG 
TTTGCAATCC 
CCACCCCACC 
TTGGAGTCCT 
CGTTTCTGTG 
CTGGAGAGGA 
CCTATCTTCC 
TTTGCTTTGC 
CTTTCTTCCC 
TGGGGAGAAG 
AGGCCCTCTC 
TTCTCAGTGA 
CATCACTATG 
GTGGGGGCGC 
AGGAGAGACT 
CTCTCAACCT 
ATCCTCCATG 
CTTCAAAGAG 
ATTCTGAGAA 
ATTTGGGGCT 
GGAGGGATGG 
CACCCGAGCA 
GAGGGGATTT 
CTCCAGAACA 
TGACGACTTC 
CAAACCAAAT 
GATGTCTGTC 
TTACACTAGG 
CAAACACAAA 
AGCGAGACAG 
AGCAGTTCTT 
AGCAAAGAAC 
AAAAGGATAT 



AAAGGGGCGA 
GGCAGCAGCA 
GTGGGCAGGC 
ACACCGGTGA 
ACAGCCCGGC 
AACAAGTCAG 
GTGCGGGTGC 
CGCATCCAGC 
TTCGAGATCA 
ATCGCGGTGC 
AGCCCCGACG 
TGCAAGCCCC 
GACTCTGGCT 
TACCTCTTTC 
AGCCCCCCGC 
GCCTCCTCCT 
TCCCCTGCCA 
CCGCTCCAGG 
GATTGCATGG 
CTGTTCTGCA 
TGCCACATCA 
CCGGGGCCCA 
CGGTGATTTG 
GCAGTGGTGG 
TTGGGAGGGG 
CTTAAATCTG 
TCTGACGCCC 
GGGTGGGTAA 
CTAAGCCCTC 
CCCATATCCT 
ATCAGAAGTT 
GCTGAAAGGC 
GTCCTCCTCT 
CTTCCTTTGG 
TTTCTTTTGG 
TTCCCCCTGC 
TAGGGGGAGG 
CCCCACTCCT 
AAATGTGAGG 
GGCATTATAT 
CAGAGTAGTG 
GGTGGAGGGA 
GACCTGAACC 
CAAATCATGG 
GGGTCAGCCT 
TCACATATTG 
GCCACCAGCT 
GAGTCCCAGG 
CTTAAAGCTC 
CCTCCTCTCC 
CCAGCCAGGG 
CCTCCTTCCC 
GGGCACCCTC 
TGTCGTCCCC 
GGGCTCAGCA 
ACACGAATAG 
GCGACCAACA 
TATAATTTAA 
CTGGGCTAAT 
TTTATTTTAT 



41 

! 

GCGACGCCAA 
ACACCACGGA 
AAGGCTGCAA 
GGGGCGAGGA 
GGGAAATCAT 
GCGCCGCCTT 
CCTACCGCGT 
AGCAAACCAA 
CGGGTGCCCC 
GCACTGGCAA 
CAGCAATCGA 
TCTCCACCTT 
TTGAGGCCCC 
CGGGCTATGG 
TGTGGGCGGG 
CCTCCTCCTC 
CTTCCGCGGG 
GCTTCTCTAA 
TCTGCTTTGA 
TGGAGTGTGC 
CAGCCACGCA 
CTCCACTGGG 
AGGGGCAAGG 
CTGGAGGGTG 
GCCAGACTGA 
GAAGCAGCTA 
TCTGCCCACC 
AAGGGAGAGG 
TGGTCCTGAC 
GTTTGGGAAA 
GCATAGATCT 
TCAGAAGCCA 
CTGCCCCTGT 
GATCTTCCCT 
TGGCTGTCAC 
CCCTGCCCCT 
AATATTTGGA 
CCAAAGAAAC 
AGGGGACTAA 
TTAGGGAGTA 
TGTCTGCTAT 
AGAGCTGCTC 
TTTATTGAAT 
AGTCTGTGTC 
GGGGCAAAGC 
TTACAGGCCT 
CTCCACCCTC 
AGATCCTTGT 
CAGAGTCCCA 
TTCCTCCTGT 
AGGGGTGGGG 
GCCTATGTGA 
GGTTGCCGGG 
CTCCCCCTCT 
AATTCAATCC 
TGAGAGACAA 
CAGAGGAGAG 
TATTCTATTT 
ATGAAAAAAA 
GATAAAGTGA 



51 



I 

GCTCTGCGCT 
GTGTGTTCCC 
GATTAAGGCC 
ACCAGTGTTC 
CTCAGCAGCG 
TGGTGTGGCT 
GGTGGGGCTG 
CACATACATT 
AGGCAACGTG 
GATCCTCGAG 
TAGCCGCTAC 
CCGGCAGAAC 
ACGCCTGGGT 
CGTGGGCAAG 
CCAGGAGAAC 
TTCCGCCAAG 
ACCCGAGCTG 
ACTTGGTGGG 
GAGCGAAGTG 
AGTACGCATC 
AGCCATCCGA 
CCCAGCCTGG 
TGCTTAGAGA 
CGCCACTTTC 
AAATTTTACT 
AGAGAAACTT 
TCCCCCGCTG 
GAGAATTACC 
CTCCGACCTC 
CTGTCACCAG 
TCTAGGGGTT 
TCTGTCCCCA 
CCCCTGCAGT 
TTCCTCAACT 
TCCCAGCTCT 
ACCAGCCCAG 
TGGTCCCTCC 
ATCTCAAATT 
TACTGGGGTA 
GTTCTTGGGC 
TTAAAGGAGC 
CTCCCATGCA 
CCTTATTAGC 
CCACCTGATG 
AGGACTGGGG 
TGCACCCCCT 
CTGGTTCCGC 
AAATAGTGGG 
TTCTTCCTGG 
CGAGAATTAA 
AAGGAGGTCA 
GCCATCCTGA 
GGGCAGGTGG 
CCACTCTTTA 
CACCCTTACC 
AATAGAGGAG 
AAAACAAAAA 
TAATAAAGGC 
AAGACTTTTT 
TCCTTAAAAA 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2BB0 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 



232 



WO 03/003906 



PCT/US02/21338 



AATAAAAAAA CTTTAGAAGG TTTAQAATAT ATGTAGGGAG AGAAGAAGAA AAAAATACAT 3660 

TTGTATTCAG AGTTAAATCT TAAAAAAAAA AAQTGTTTTT AATATATGTT TGGGTTTACG 3720 

TTGCTTTTTT CCCCCACTTT TTTTTTGGGG AGGAATGTCA TTTGCTTTTC TTGGGGGAGC 3780 

ATCCCGGGGG TGAATGGTGG AGAGAGGAGC TGGGGGAACC CGGTCCCTCC TGGGACCCTT 3840 

5 CCAGTAGATT GGATTTCACT CCATGGACTC CTCCTCCCCT CTCCCCCTCC CCCTCAGGGG 3900 

AGCCGGCAGA GCCAAACAAA GAAAGGGATT AACAAGAAAG GAAGAAGCTG TAGGACTAAG 3960 

GACTGAGGAT CCTGGGGTGT CCCCCACCAC TTTCCCCTGC CCTGTCGCAG GGGCAAGTGA 4020 

GGAGGGGGAA TCCAGAATTA AGGCCTAGCA GGCCTATAGG AACCCTCAGA GATGTGTGAG 40 BO 

ATTTAAGAGA TCTAGATTTT TTTTTAACCA AAAACAAGAG AGAAAGAGAA GAAAAAGAGA 4140 

10 AACCGAGGGG TTTAAAAGAA AAGAATACTA CAAAATAATA ATTATTAATA ATAATAATTC 4200 

AAATTTATTT CATATAATCC TAGAGAGAGA AAGAAACAAT TACTAGTTAC TTAGTAGACA 4260 

ATATTAAGAT AGCTTAAAGT TTAGTAGCAT TGAGGGCCCC TGGGTCCAGT AGAATGTATA 4320 

AAAGTTGTAA GGAAAAGATA AATAGAGGAG GGAAGTGGCT GAGTCCACCC TGAGTTGCCC 4380 

AATCTTCAGA TACCAGGGTT GGATCAGGTT GCTAGTTTAA GATTGGGAGC TTCCAGTCTG 4440 

15 CTGGGGTTGA TTCTGAGAAT CCTTGGATTT TTAAATTGTA GGACAAAGAA ATGAGGGGTT 4500 

CATTTCCCAG GGTCTTGGAA AGGATGCACA CTGATCATCT CAATAAGACA GGGGCTGGGT 4560 

TGGGGGCAGC AGAGGAGGCC AAGCACATTC ACCTGCACCC CTAGTACCTG GGCAGCCCAT 4620 

ACTCCAATGT GGTATGTCCC CTCCTGGGGC TCCCAGCTCA AACCCTCCCA TGCCTGCTTC 4680 

CCCCAGGCCT AACTGAGGAA GTCCTTCTTG AAGTGTGACC TCGGTCCACT TCTCTACAGA 4740 

20 TTGATTTAAG AGCCTGGGAA GTCATTCCAC AAACAGACAC ACATGCACAC ACGCTTCTCA 4800 

CCTTCAGAGC TTCAAGAGCA CTGAGGCGAT CAGTCCCCTA CCCCTGTTCC CATCCAGCTT 4860 

TCCACTTAGC TTTGACCTCC ATGGCAGCAG TAGCAGTAAC AATCTCAGTA ATTGTTCTTT 4 920 

AAAGCTGACT CGTTCTTCAC CTACTTGCAA AGTGCTTTCT TGTCTCATAA AAGTTAGATT 4 980 

CCAAGAAGGA CTTCCCACGO AGTGGAGTGG AAACACTGTC CTTGAAGGCC TGGGAGAAAG 5040 

25 GCATCCCCAT GGG CACAGAG GCTGGGGAAA GGCACAGGGA CTTTGGGTGA CCCTAACCCT 5100 

GACCCTCTGC TCCAGTTCAC CTCCATCTAT ATGTGTTCAG GTAGGGGTCA TCTACTGTAC 5160 

CCTGGCCTGG GAACACATTG CCCTCCCCAC ACAAAACTGG AGGGCTTGGC TTCTGCGTGT 5220 

GAGAAATCAA CATTTTTAAA GCACTTGCCT TCTACCAACC CCAGCTTGCA ATCACTGGGC 5280 

CTTCCCCTCC TATCCAAGGG GTTGGAGGGG CCCCTTGGCT CTCCTTTTGG CAGGAGGAGC 5340 

30 CTGCTTCATT ACACCAATGA CTCTGCCATC CCCCTCCCTG GCCCTAGACC CCAAACACAT 5400 

CTCCCTCTAC CCAATTTACT CTTCTCGCCC CACCTAGGGA CAGATTCCCC CTGCTCTTTT 5460 

TGTCCTAGAA ACCCCGCTAG TTTGGGATGG TAGCGTCTGG GGTGGGGAGG GCTTCCCCTT 5520 

CCCCACTCGA GGGTGCGGGT GGGGAAGGGG GGGTGGGTGG AGACAGCCCT GGGGCAGGGA 5580 

GGATGGTCTC TCCACTGTAG AAAGTAGAGT AGGATTGTGG TCAGACTTAA TTTGAGGCAT 5640 

35 CTAGTGAAGA CACGTACAAA TCCACCAAGG AAAAAGATTT CAAAAGCAAA ATAAAAGCGG 5700 

GAAATAAAAC AGACCCAAGA ATAATCAAGT CAAAGTGATG TTGCACAAAA TGCAGAGAAA 5760 

CCAAGAAGGG GGAGGGTTAA TGTATTAAAT GTGCTATTAA GAACTTAATT TTATTAAAAG 5820 
TACTATTACT TAAGGCTC 

40 Seq ID NO: 16 Protein Sequence 
Protein Accession #: XP_044166 



60 

120 

180 

240 

300 

343 



45 1 11 ?1 31 4i 51 

III*' 
MVTGRREDVA TARREIISAA EHFSMIRASR NKSGAAFGVA PALPGQVTIR VRVPYRWGL 
WGPKGATIK RIQQQTNTYI ITPSRDRDPV FEITGAPGNV ERAREEIETH IAVRTGKILE 
YNNENDFLAG SPDAAIDSRY SDAWRVHQPG CKPI#STPRQN SLGCIGECGV DSGFEAPRLG 

50 EOGGDFGYGG YLFPGYGVGK QDVYYGVAET SPPLWAGQEN ATPTSVLFSS ASSSSSSSAK 
* ARAGPPGAHR SPATSAGPEL AGLPRRPPGE PLQGFSKLGG GGLRSPGGGR DCMVCFESEV 
TAALVPCGHN LFCMECAVRI CERTDPECPV CHITATQAIR IFS 

Seq ID NO» 17 DNA sequence 
55 Nucleic Acid Accession #: NM_005940 
Coding sequence: 23-1489 

1 11 21 31 41 51 

60 AAGCCCAGCA GCCCCGGGGC GGATGGCTCC GGCCGCCTGG CTCCGCAGCG CGGCCGCGCG 
CGCCCTCCTG CCCCCGATGC TGCTGCTGCT GCTCCAGCCG CCGCCGCTGC TCGCCCGGGC 

TCTGCCGCCG GACGTCCACC ACCTCCATGC CGAGAGGAGG GGGCCACAGC CCTGGCATGC 180 

AGCCCTGCCC AGTAGCCCGG CACCTGCCCC TGCCACGCAG GAAGCCCCCC GGCCTGCCAG 240 

CAGCCTCAGG CCTCCCCGCT GTGGCGTGCC CGACCCATCT GATGGGCTGA GTGCCCGCAA 300 

65 CCGACAGAAG AGGTTCGTGC TTTCTGGCGG GCGCTGGGAG AAGACGGACC TCACCTACAG 360 

GATCCTTCGG TTCCCATGGC AGTTGGTGCA GGAGCAGGTG CGGCAGACGA TGGCAGAGGC 420 

CCTAAAGGTA TGGAGCGATG TGACGCCACT CACCTTTACT GAGGTGCACG AGGGCCGTGC 480 

TGACATCATG ATCGACTTCG CCAGGTACTG GCATGGGGAC GACCTGCCGT TTGATGGGCC 540 

TGGGGGCATC CTGGCCCATG CCTTCTTCCC CAAGACTCAC CGAGAAGGGG ATGTCCACTT 600 

70 CGACTATGAT GAGACCTGGA CTATCGGGGA TGACCAGGGC ACAGACCTGC TGCAGGTGGC 660 

AGCCCATGAA TTTGGCCACG TGCTGGGGCT GCAGCACACA ACAGCAGCCA AGGCCCTGAT 720 

GTCCGCCTTC TACACCTTTC GCTACCCACT GAGTCTCAGC CCAGATGACT GCAGGGGCGT 780 

TCAACACCTA TATGGCCAGC CCTCGCCCAC TGTCACCTCC AGGACCCCAG CCCTGGGCCC 840 

CCAGGCTGGG ATAGACACCA ATGAGATTGC ACCGCTGGAG CCAGACGCCC CGCCAGATGC 900 

75 CTGTGAGGCC TCCTTTGACG CGGTCTCCAC CATCCGAGGC GAGCTCTTTT TCTTCAAAGC 960 

GGGCTTTGTG TGGCGCCTCC GTGGGGGCCA GCTGCAGCCC GGCTACCCAG CATTGGCCTC 1020 
TCGCCACTGG CAGGGACTGC CCAGCCCTGT GGACGCTGCC TTC6AGGATG CCCAGGGCCA 
CATTTGGTTC TTCCAAGGTG CTCAGTACTG GGTGTACGAC GGTGAAAAGC CAGTCCTGGG 

233 



60 
120 



1080 
1140 



WO 03/003906 



PCTAJS02/21338 



10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 



CCCCGCACCC 
GGGTCCCGAG 
CAGCACCCGG 
CTCTGAGATC 
CCTCTACTGG 
GGGTCCTGAC 
ATGCCCTCAG 
ATCTTTGTGG 
GGTGGGGTAC 
AGCGACTGTC 
GGGACCCGCT 
GTAGCACCAT 
TCCTTCCAGG 
TGAGCAACTG 
ATCTGTCTGC 
GTTCACAGTC 
CAACATACCT 
ATCCTCCAAA 
TTTTTAAACT 



CTCACCGAGC 
AAGAACAAGA 
CGTGTAGACA 
GACGCTGCCT 
AAGTTTGACC 
TTCTTTGGCT 
GGGTGCTGAC 
CTGTGGGCAC 
AACCACCATG 
TCAGACTGGG 
ATGCAGGTCC 
GGCAGGACTG 
GGCTGGCACT 
GGCTGTAGGG 
CTTCTGGCTG 
AAATGGGGAG 
CAATCCTGTC 
GCCATTGTAA 
GAGGATTGTC 



TGGGCCTGGT 
TCTACTTCTT 
GTCCCGTGCC 
TCCAGGATGC 
CTGTGAAGGT 
GTGCCGAGCC 
CCCTGCCAGG 
CAGGCATGGG 
ACAACTGCCG 
CAGGGAGGCT 
TGGCAAACCT 
GGGGAACTGG 
OAAGCAAGGG 
CAGGGCCACT 
ACAATCCTGG 
GGGTATTCTT 
CCAGGCCGGA 
ATGTGTGTAC 
ATTAAACACA 



GAGGTTCCCG 
CCGAGGCAGG 
CCGCAGGGCC 
TGATGGCTAT 
GAAGGCTCTG 
TGCCAACACT 
CCAOGAATAT 
ACTGAGCCCA 
GGAGGGCCAC 
TTGGCATGAC 
GGCTGCCCTG 
AGTGTCCTTG 
TGCTGGGGCC 
TCCTGAGGTC 
AAATCTGTTC 
CATGCAGGAG 
TCCTCCTGAA 
AGTGTGTATA 
GTTGTTTTCT 



GTCCATGCTG 
GACTACTGGC 
ACTGACTGGA 
GCCTACTTCC 
GAAGGCTTCC 
TTCCTCTGAC 
CAGGCTAGAG 
TGTCTCCTGC 
GCAGGTCGTG 
TTAAGAGGAA 
TCTCATCCCT 
CTGTATCCCT 
CCATGGCCTT 
AGGTCTTGGT 
TCCAGAATCC 
ACCCCAGGCC 
GCCCTTTTCG 
AACCTTCTTC 



Seq ID NO; 18 Protein sequence: 
Protein Accession ft: NPJJ05931.1 



MAPAAWLRSA 
PAPATQEAFR 
LVQEQVRQTM 
FFPKTHREGD 
YPLSLSPDDC 
VSTIRGELFF 
QYWVYDGEKP 
PVPRRATDWR 
AEPANTFL 



11 
I 

AARALLPPML 
PASSLRPPRC 
AEALKVWSDV 
VHFDYDETWT 
RGVQHLYGQP 
FKAGFVWRLR 
VIjGPAPLTEL 
GVPSEIDAAF 



21 
I 

LLLLQPPPLL 
GVPDPSDGLS 
TPLTFTEVHE 
IGDDQGTDLL 
WPTVTSRTPA 
GGQLQPGYPA 
GLVRFPVHAA 
QDADGYAYFL 



31 
I 

ARALPPDVHH 
ARNRQKRFVL 
GRADIMIDFA 
QVAAHEFGHV 
LGPQAGIDTN 
LASRHWQGLP 
LVWGPEKNKI 
RGRLYWKFDP 



41 

1 

LHAERRGPQP 
SGGRWEKTDL 
RYWHGDDLPF 
LGLQHTTAAK 
EIAPLEPDAP 
SPVDAAFEDA 
YFFRGRDYWR 
VKVKALEGFP 



CCTTGGTCTG 
GTTTCCACCC 
GAGGGGTGCC 
TGCGCGGCCG 
CCCGTCTOGT 
CATGGCTTGG 
ACCCATGGCC 
AGGGGGATGG 
GTCACCTGCC 
GGGCAGTCTT 
GTCCCTCAGG 
GTTGTGAGGT 
CAGCCCTGGC 
AGGTGCCTGC 
AGGCCAAAAA 
CTGGAGGCTG 
CAGCACTGCT 
TTCTTTTTTT 



51 
I 

WHAALPSSPA 
TYRILRFPWQ 
DGPGGILAHA 
ALMSAFYTFR 
PDACEASFDA 
QGHIWFFQGA 
FHPSTRRVDS 
RLVGPDFFGC 



1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 



Seq ID NO: 19 DNA sequence 

Nucleic Acid Accession ft: NM_005794. 

Coding sequence: 434-1276 



11 
I 

CACGCTGTGA 
GGGTCCGTGC 
GGGGCCAGCA 
AAGAACCGTA 
GAACCCATCA 
TGGTGAGACT 
TGAGACTATC 
ACTATGCTGT 
TCTGTGAGGA 
GTGGTCACGG 
GGGGCCCACG 
CTGCAGGGGG 
CGGGAGCAGC 
AGCGCAGGGG 
AAGATCCTAA 
ATGGAGAACA 
GTGGCGCTGG 
GCATTGGAGC 
ACTGACTTCA 
CATCAGCTGC 
TCTCCAGATG 
CTCTGAGAGG 
GTCTAGGTGA 
GGGCTTACTC 



GGTTCCCTTC 
CTCACTCGTT 
CTTCACTCCT 
CACCATCTTT 
GTGAGGCCAA 
TATCACCAAG 
CACCAAGCGG 
GACACCAACC 
TGCTAGGCTT 
CCGGGTAGCC 
GGCCCGGGAC 
CATGGCCAAG 
GGCTGAGGAC 
CCTGGTGTGC 
GATCTGGGAC 
GCTGCCCTAC 
TAATCCAGTA 
TAGAACACTG 
AATTATAAAA 
CAAGGAACAT 
CTTCCTGTGC 
CTCCACTCGG 
CTGAGGGGGT 
AGCAATTTGG 
CC 



Seq ID NO: 20 Protein sequences 
Protein Accession #: NP_005785.1 



21 
I 

AGCTTTGTTC 
CACCTTTAAG 
AGACCACGAA 
ATACTCACCG 
ATTCCGTACA 
ATTGCCAAGC 
ACCTATCGCC 
CAGCAGTTGC 
TGAGCAGCAC 
GGTCCACCAG 
TGGTCATCAG 
AGGGGCTGAG 
TGGTGGCCAA 
TCAACCCTCT 
GTGTGAACGT 
GGAGGGGTGC 
GTGTCTACAA 
TGGCCCCCAA 
GCAAAGTGTT 
AGAGGATTGG 
CCAGCTACGT 
AGTGGGGGCG 
TCATTTGGAT 
ATGCTAGGCT 



31 
I 

TTTTGGTCTT 

AGCTGTAACA 

TGCACCGAGA 

CAAGGGTCTG 

CATTTTGGTG 

AGTGAGACTA 

AAGTGGCCTG 

CCGGGGCTAC 

CGGGATAGAC 

TGGGATCGGC 

CAGCCGGAAG 

TGTGGCGGGC 

GGCCCTGGAG 

GGTAGGGAGC 

GAAGTCCCCA 

TGTCATCCTG 

TGTCAGCAAG 

GGACATCCGG 

TCATGGGAAT 

GGAGTCAGAG 

CAACGGGGAG 

GCTGCGTAGC 

CTGGAGCAGA 

TGAGGAAGAA 



41 
I 

CATGATAAAT 
CTCACCGCGA 
GGAATGAACA 
CAACTTCATT 
ACTTTGAAGA 
TTGCCAAGTG 
ATTCAGCAGG 
CAGGGCTGGT 
AGGAAGGGCG 
TTTGCCATCG 
CAGCAGAACG 
ATTGTGTGCC 
CACTGTGGGG 
ACTCTGGGGA 
GCCCTGCTGC 
GTCTCTTCCA 
ACAGCGCTGC 
GTAAACTGCG 
GAGTCTCTCT 
GACTGTGCAG 
AACATTGCGG 
TGTGGTCCCA 
GTCTGCCATT 
GAAAAACGCT 



60 
120 
180 
240 
300 
360 
420 
480 



51 
I 

CTTGCTGCTG 
AGGTCTGCAA 
ACTCTGGACA 
CTTGAAGTCA 
QACTGTCACC 
GTGAGACCAT 
AAGCATCTCA 
TTCATCCCTG 
TCCTGGCTAA 
CCCGACGTCT 
TGGACCGGGC 
ACGTGGGGAA 
GCGTCGACTT 
CCAGTGAGCA 
TGAGCCAGTT 
TTGCAGCTTA 
TGGGTCTCAC 
TGGTTCCAGG 
GGAAGAACTT 
GAATCGTGTC 
TGGCAGGCTA 
GCCCAGGAGC 
CTGCCAGACT 
TCGGCATTCT 



SI 



60 
120 
1B0 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 



1 11 21 31 41 ^ 

LcAVARGYQ GWFHPCARLS VRMSSTGIDR KGVLANRVAV VTGSTSGIGF AIARRLARDG 
AHWISSRKQ QNVDRAMAKL QGEGLSVAGI VCHVGKAEDR EQLVAKALEH CGGVDFLVCS 
AGVNPLVGST LGTSEQIWDK ILSVNVKSPA LLLSQLLPYM ENRRGAVILV SSIAAYNPVV 
ALGVYNVSKT ALLGLTRTLA LELAPKDIRV NCWPGIIKT DFSKVFHGNE SLWKNFKEHH 
QLQRIGESED CAGIVSFLCS PDASYVNGEN IAVAQYSTRL 

234 



60 
120 
160 
240 
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Seq ID NO: 21 DNA sequence 

Nucleic Acid Accession ft: EOS sequence 

Coding sequence: 77-904 

1 ii 21 31 41 51 

I I I I 1 1 

GCCGOGTTTT GCCTCCGCAG CAGCTCTGGG CTCTTCTCAG CTGOGCGAGC AGCTGCTCCA 
ATGCCCCGGA GTGGCCATGG GCGCCCCGCA CTGGTGGGAC CAGCTGCAGG CTGGTAGCTC 
GGAGGTGGAC TGGTGCGAGG ACAACTACAC CATCGTGCCT GCTATCGCCG AGTTCTACAA 
CACGATCAGC AATGTCTTAT TTTTCATTTT ACCGCCCATC TGCATGTGCT TGTTTCGTCA 
GTATGCAACA TGCTTCAACA GTGGCATCTA CTTAATCTGG ACTCTTTTGG TTGTAGTGGG 
AATTGGATCC GTCTACTTCC ATGCAACCCT TAGTTTCTTG GGTCAGATGC TTGATGAACT 
TGCAGTCCTT TGGGTTCTGA TGTGTGCTTT GGCCATGTGG TTCCCCAGAA GGTATCTACC 
AAAGATCTTT CGGAATGACC GGGGTAGGTT CAAGGTGGTG GTCAGTGTCC TGTCTGCGGT 
TACGACGTGC CTGGCATTTG TCAAGCCTGC CATCAACAAC ATCTCTCTGA TGACCCTGGG 
AGTTCCTTGC ACTGCACTGC TCATCGCAGA GCTAAAGAGG TGTGACAACA TGCGTGTGTT 
TAAGCTGGGC CTCTTCTCGG GCCTCTGGTG GACCCTGGCC CTGTTCTGCT GGATCAGTGA 
CCGAGCTTTC TGCGAGCTGC TGTCATCCTT CAACTTCCCC TACCTGCACT GCATGTGGCA 
CATCCTCATC TGCCTTGCTG CCTACCTGGG CTGTGTATGC TTTGCCTACT TTGATGCTGC 
CTCAGAGATT CCTGAGCAAG GCCCTGTCAT CAAGTTCTGG CCCAATGAGA AATGGGCCTT 
CATTGGTGTC CCCTATGTGT CCCTCCTGTG TGCCAACAAG AAATCATCAG TCAAGATCAC 
GTGATGGCAA GATGGTGGCT GGCTTCTCTG CTTATCGCCC CTCATGCAGT GGGCTTCCTT 
TGCTAGGAAG ACAGCCAAGG GAGTTCGAAT AGTTGGGGTG TGGGCTATCT TTTCAAAAAT 
CTATTTGCTG GGGCTCTTAA TTTCTTTAGT GTTCTTTGTA TGTAG GGAT T TAAACTTTGT 
CATATGGTAC AAATATTCCC TGCCCCCCTG CAGTTTCCCA TTTGTCTTTC AGTATGTTAA 
TATTTTTGTG CCATACTGGT TTTAAACTTT CATGTTGTCA CATCTGTTAA TCTTTTCTTT 
AGGATTTCTG GATTTTGTGT AATTTTTAAA AAGGTCCCCT CCTCCTCCCT AATGTGTCTG 
TGGACCACCT GGATTCCACT GTACAAGGGG AAAAGTGTCT ATTCCTTTCC CAAAGATGGA 
AAATGGAGGG CTTAGGGACA CTAGATGCAT CTTTCTCAGC ATCACTTCCA GATGCAGTGA 
CTTGTTGGGC TGCGTCCTTA ATGGCCATGG CAGAGCAGTC CCTTGGGGGA TCCAGCCCTG 
TACAATGCAT CTCTTCCTGG AGAAAGCTGG CCTGCTCCAG ACCCCACCAT TCCCAGGCGC 
CCTTGGAGTG GACTCTACTG ATGACAGACA GACCCTCTGA GAGACAAGAC CCTCTGACTC 
TGTGATGGAA GATGCCAGAG ATTTTCCTTT GGGGTAATTG TCCTTAAACA AAACCAAACA 
GATGAAACAC ACACAGGACT TGTGGCTAAA AAGGCTAGTT TTTCACTTGC ATTTCTCAAC 
TAACCCAGGT TTTACATGCA TCTGTGAATC CTTTTACTAC TACCTCTGTG GAGAGATGGA 
GAGACTTCAG ATAAACGTGA AGCTAATGAG TAAAACCCTC TCTGCCAAAA CCTACACTCC 
ACTTTAGGCC CTTCTTGAAG ATGAGCACAA TTTT TAAAT A CTGAGCACAA TTTTTAAATA 
CTGACATCAC TTCCTCTTCC CCCTCCCACC CCAGCTCAGC AGCCTCAAAT CTACAGAGAA 
GAAGAATTAT GGCATGAACA TTCCCACAGA CCCACCATCT TTAAGACTTG ACCTCTGTAA 
GTTTACCAAA GGGCTCCTCA CAATTGTGGT GGGGGTTCTG GTTCAAAATT TGGAGCAAAC 
ATGAAGTTTT TGGAAACGTT TTCTCATTTG AAGCCTCCAG TATGCTGTAC TATTCTGGAA 
ATTACCTTCA AGAGTCTCAC TTCTTGTTTC TGTTGTGTTT TCTGTGGGCA TCATGTTCTT 
CACGCTTGCA GTAGAAGGTG CTTTCTCGGT TTCCCAGAGT ATCCAACGGC TCACCTTTCT 
CAAGTGCTGG CAGTAGCTAT GCACTCACGG GCTGGTTTGG GTCGCTGGTG CAGCAGCGCA 
AATCTGTTGC CTTCTGAATT TTTCTCACCT AATGTGACAC TGGCTACAAT GAATCTTCTC 
TTCATCGGGC TGAATGAAAG ATTCAAGAAC CATCTTCAAG GTGCATGGTG GGAATTATCA 
ACCTCAGGGA TACT CAT TTT AACTCAGGCG TGTCCTGCTT TGTAACATTC CATTGTTGGG 
AGAGGGCAGG ACAGGTGTGT TCTTCTGTGG GCAGGAGTCA TGTCACTGTC CTACATATGT 
AAGAGTTGGG AAGGTGACGA TTTTTGACAC ATCCAGGAAC TCTTACTCTA GTTAG^TTT 
GTACCAGATC CAAGGTGAAA ACCCCAATAA GCAACTGAAT TTAGAGTTTA AAAATGAATG 
ACTTTATGCT ACATCTGTGG TTATCAAATT ATATAGGTTG TTGAGAAGCA GAACGCTGTT 
TGTAGTAAGA AATCTTTGTG GAACCCCAGT GTGTGAAGTA AATTGTATGT TATTAAATTT 
ATTTAAGGTT AAATTTATGG CATTTACTTA ATAATATATG AGGTGGTGAA AATQCAAATT 
AACAAATTGG TAATTTCCAA GGTAGAAAAA TTAGGTGTTG AATGAATGTA TATGTTGGTT 
TATTATAGTT TTATATATAT ATAGAGAGAG TGTTTTGTTT TTGAGTCAGG GTCTTCCTCT 
ATTACCCAGG CTGGAGTGCA GTGGTGCAGT CATGACTCAC TGTAGCCTCT G^TCTCCCAGQ 
CTCAAGTGAT CCTCTCACCT CAGCCTCCCC AGGAGCTGGG AGTACTGGTG CCTGCCTCCA 
CTCCCAGCTA ATTTTTGTAT TTTTTCATAG AGATGGGGTT TCACCATTTT GCC^GGCTG 
GTCTCAAACT TCTGGGCTCA AGAGATTCGC CCGCCTCGGC CTCCCAAAGT GCTGGGAATA 
GGCATGAGCC GTCAOGCCTG GCCTAAAAAA TATTTTTAAA ATGATCTTTG AATTAAATAT 
TCGTAGAATT TCTAATGTAT CTCTTTGAGA CCTAGGAGGT TGATGGAACA GAACTGCTGT 
TAAGTCCTTT GGGTTTCAAG TCTAGAATTT TTTAAAGGCA AATATCAGCT CATTCTTATT 
TTAGATTGAC CTTATCAGGC ATGGATTCTG GTCTCATCTA CTTTATGGTA TAAATGCTCC 
AAGGTAGGGG GTTTGGTATA TATTTTAAGC CCGGCTTTTT ™TTTTT TTT^TITTTT 
TTTTTAATGT GAGAAGCAGA ATGTGCTTCT AGAAACTGGT TTTAAAGAGA TGAGCTGAGA 
AAGAAATGTG GAATGGAGTA TATTTGAGGA GGACAAAACA TAACTTCACT ™AAC*» 
AATCACTCTA GCTTGCCAGC ATGGGATGTA AACCAAGAGA GTAGAAATAT ACCCATCTTA 
TTTTAAGTTG GGTTTATGGC ATCGCTCATA TATGTAAAAG CACTACAAAC TC^TAAAGA 
AAATTGGGAA ACTACAGAGA AGTCAAAGAA AAAAAAAAGT AACCCATATT TCTATTGCCC 
AGGTATAATC CTTGTTAATA TTTTGGTTTG gtctcctctt tttttccccc aatatagttg 
TAAATAAATG ATGTCTTTCA GAGTTGACAT TTATCCTGTA GCTTGAATGG CATGTAAATG 
Cct^lT ATTTTTTCAT GAAGTGTAGG TTTGGAATAC ACTAGAGTTA GCTATATGCT 
TGAATGCTGA TCACTGGATT CTGAGACTGA CTACTGAGTC TACCTTTTTA ^AAGCCTA 
ACATGAATGG GCTCCAAAAA GTAATGAATG TAATTGTACT TTTTGATGTG OTCTGCACT 
TGGCTTGGTG AGTCATCATA AATAGCTGTT AAATATGTGA CTTTACAGAT TTTGATATGT 
TCAGATTGTA AAAAATGAAT AGTTTATTTC ATTAATTGAT GGGCAGTCAA GAATCTCCCT 
CC 



60 
120 
1B0 
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300 
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420 
480 
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660 
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235 
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Seq ID NO: 22 Protein sequence t 
Protein Accession 8: EOS sequence 

1 11 21 31 41 SI 

| | | | I I 

MGAPHWWDQL QAGSSEVDWC EDNYTIVPAI AEFYNTISNV LPFILPPICM CLPRQYATCP 60 
NSGIYLIWTL LVWGIGSVY PHATLSFLGQ MLDELAVLWV LMCALAMWFP RRYLPKIFRN 120 
DRGRFKVWS VLSAVTTCLA FVKPAINNIS LMTLGVPCTA LLIAELKRCD NMRVPKLGLP 1B0 
SGLWWTLALF CWISDRAFCE LLSSFNFPYb HCMWHILICL AAYLGCVCPA YFDAASEIPE 240 
QGPVIKFWPN EKWAPIGVPY VSLLCANKKS SVKIT 



Seq ID NO: 23 DNA sequence 

Nucleic Acid Accession &: CAT cluster 

1 11 21 31 |1 51 

TGAAGTATCT CTGTTTAATG TCATATATCT GAGATGTTCT GTGGAGCCAG AATTCAAAAC 60 

CCAGATACGC AGGAGGACAC ATTCTTGATC GGTATTTTAT TCTATTATTT TTATTAGTTC 120 

CAATCCTTAT GACTCTGTTT AAAATGAGTA TCTGCATTAT TGTGCAAACA CTTCTGAGAT 180 

CTCATCACAT ACCTGATCCC ATTTAACTTC AGTCTATCCC TCCATTACTT TTTGTGGAAA 240 

GTGCTATCTA AGTTGGAGGT TCTGCTTGAG TATTCTTAAT TTTTCTTCTT TTTAATGCAT 300 

ACGGTGGTCA AAGTAAATAG CTAACATATT GGCATCAAAC CATTGCCTAT TGAAGCAAGC 360 

TTTGAATGAA ATTGGTTTGT CATTGACCAC TGTAAGAAAC TCATTTGTAA AGATAATACT 420 

GAAAAACATC CTGACTCTTA TGTATGCTTG CCTGTCTCTG AACTCCACCT CTATCAGCAT 480 

ACTCAATACA TTGCCTCTTT CCAACATTTA TCTCCTATTT CAGGAGAAAG TTTTGT GGGA 540 

OCGTCTACCA CGTTGCAAAG ATTCAACATT TCTCTTAAAA ATAGGAGTTT CTTTTAGATA 600 

TAAGCAATGA AATTTCAAGC AACAATATTA CTTCTCTCAT GTCCTTTGCT TTTTTAAGAT 660 

CTGAACCTCA AGATTCAAAT GGATTTTCAT TGAAATTCAC CAOGATCCCC TCTGCAGCTG 720 

TGGTCTGAAA GTACCATTTC TGCATTTATT TTAGCCCATG GAATAACTGT GCTGAGAAAC 780 

CACAGAGTCA ATCAGATTCA AAATGTTAAA ATCCTTCCTG CTTGGAGTTT TCCGTCTTCA 840 

CATCAAAGCA TTTCATGCCC GTCAGCAACT TTTTAATGCA TTTGCTCATG GTTTGCAGAA 900 
TTTCCATTTA AAGACTTTCC TTGGCTGACT TCTCTGATGA GGTTTCCTGC TTGCCTC 



Seq ID NO: 24 DNA sequence 

Nucleic Acid Accession #: Eos sequence 



I 



GTGTGTAGTC 
AGTAGGCTCA 
GAAAGTGGTT 
ACTACTTGGC 
ACCCTCTGTG 
CTGTTCCTGG 
AAAGGAACAT 
CCAACAGTCC 
CCTGGTGCAA 
GTCCTCCGGA 
GCAAGCAGGA 
AAACGAGGAG 
CGGAAAACTC 
CAGCACAGTT 
CAGAGGGGAT 
AAAAAGCAAA 
AAAGAAACTC 
AAAACTTTCT 
ATAGAGGTGG 
ATCTTTACAA 
TTCAATAGGC 
TAAAAAGAAG 
CAAAAAGTAA 
GAAGTGTTTG 
ATAAAAATAA 
AATTCTGGCT 



11 
I 

GGGTATGAGA 
GTAGCACGAA 
TAGCATGCTG 
CTCTGGGACT 
AGGAGGATGC 
TAAGCTCCTG 
CTACACACAG 
CCATTCTACT 
AGAACAGTCG 
AAAGGGACGT 
AACCTCATCA 
CAAATGCATT 
CAAGCAGGAA 
ATTCCATGGG 
CGTGGTGAAT 
GGACATGAGA 
CTATTTTTAA 
CCTGAAATAG 
AGTTCAGAGA 
ATGAGTTTCT 
AATGGTTTGA 
AAAAATTAAG 
TGGAGGGATA 
CACAATAATG 
TAGAATAAAA 
CCACAGAACT 



21 
I 

ACGATTGCAA 
CTGCAAATAC 
GAGCTGGTTT 
CAGATTCCTC 
TGTGAGGACA 
GAAAGAAGTC 
AGGAAATGAA 
TGCAGAAAGG 
TTTCTCCAAA 
GAAGCCCCCT 
GAGAAGTCAG 
AAAAAGTTGC 
GGATTTTAAC 
CTAAAATAAA 
TTCAATGAAA 
GAAGTAATAT 
GAGAAATGTT 
GAGATAAATG 
CAGGCAAGCA 
TACAGTGGTC 
TGCCAATATG 
AATACTCAAG 
GACTGAAGTT 
CAGATACTCA 
TACCGATCAA 
TGTCAGATAT 



31 

1 

GCAAAAGCAG 
TTGAGAAGGG 
TCTGCTTGGT 
CTCTTTAAAA 
AAATGAGATC 
TATGGATTAT 
TAACATGAAG 
TTGCTTACAC 
ACTGAAGCTG 
TTGTAATTTC 
CCAAGGAAAG 
TGACGGGCAT 
ATTTTGAATC 
TGCAGAAATG 
ATCCATTTGA 
TGTTGCTTGA 
GAATCTTTGC 
TTGGAAAGAG 
TACATAAGAG 
AATGACAAAC 
TTAGCTATTT 
CAGAACCTCC 
AAATGGGATC 
TTTTAAACAG 
GAATGTGTCC 
ATGACATTAA 



41 
I 

CTGAGGGACG 
AAAAGTTCAG 
AGCCCTACAA 
TGGTGCTAAT 
ATCCACATAA 
ACTCAACCTA 
TGAAGTCTTC 
TGAAAATCAG 
GAAATTATCT 
TGCATTAGCG 
TCTTTAAATG 
GAAATGCTTT 
TGATTGACTC 
GTACTTTCAG 
ATCTTGAGGT 
AATTTCATTG 
AACGTGGTAG 
GCAATGTATT 
TCAGGATGTT 
CAATTTCATT 
ACTTTGACCA 
AACTTAGATA 
AGGTATGTGA 
AGTCATAAGG 
TCCTGCGTAT 
AC 



51 
I 

CTGTCCAAGG 
CCTTGTGCTG 
CTTTGGCCCA 
AATAGCACCC 
GCCGTGAACC 
CACTCCAGTT 
AT CTCCATTC 
TTTATTTTCC 
GAAATATCAG 
TGCTCTCCTG 
GAAATTGTGC 
GATGTGAAGA 
TGTGGTTTCT 
ACCACAGCTG 
TCAGATCTTA 
CTTATATCTA 
ACGCTCCCAC 
GAGTATGCTG 
TTTCAGTATT 
CAAAGCTTGC 
CCGTATGCAT 
GCACTTTCCA 
TGAGATCTCA 
ATTGGAACTA 
CTGGGTTTTG 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 
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1260 

1320 
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1440 

1S00 



Seq ID NO: 25 DNA sequence 

Nucleic Acid Accession ft: Eos sequence 

i ? i 1 r I 1 t 

GTGTGTAGTC GGGTATGAGA ACGATTGCAA GCAAAAGCAG CTGAGGGACG CTGTCCAAGG 
AGTAGGCTCA GTAGCACGAA CTGCAAATAC TTGAGAAGGG AAAAGTTCAG CCTTGTGCTG 
GAAAGTGGTT TAGCATGCTG GAGCTGGTTT TCTGCTTGGT AGCCCTACAA CTTTGGCCCA 
ACTACTTGGC CTCTGGGACT CAGATTCCTC CTCTTTAAAA TGGTGCTAAT AATAGCACCC 
ACCCTCTGTG AGGAGGATGC TGTGAGGACA AAATGAGATC ATCCACATAA GCCGTGAACC 
CTGTTCCTGG TAAGGAAATG AATAACATGA AGTGAAGTCT TCATCT CCAT TCCCAACAGT 
CCCCATTCTA CTTGCAGAAA GGTTGCTTAC ACTGAAAATC AGTTTATTTT CCCCTGGTGC 
AAAGAACAGT CGTTTCTCCA AAACTGAAGC TGGAAATTAT CTGAAATATC AGGTCCTCCG 

236 
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GAAAAGGGAC 
QAAACCTCAT 
AGCAAATGCA 
TCCAAGCAGG 
TTATTCCATG 
ATCGTGGTGA 
AAGGACATGA 
TCCTATTTTT 
CTCCTGAAAT 
GGAGTTCAGA 
AAATGAGTTT 
GCAATGGTTT 
AGAAAAATTA 
AATGGAGGGA 
TGCACAATAA 
AATAGAATAA 
CTCCACAGAA 



GTGAAGCCCC 
CAGAGAAGTC 
TTAAAAAGTT 
AAGGATTTTA 
GGCTAAAATA 
ATTTCAATGA 
GAGAAGTAAT 
AAGAGAAATG 
AGGAGATAAA 
GACAGGCAAG 
CTTACAGTGG 
GATGCCAATA 
AGAATACTCA 
TAGACTGAAG 
TGCAGATACT 
AATACCGATC 
CTTGTCAGAT 



CTTTGTAATT 
AGCCAAGGAA 
GCTQACGGGC 
ACATTTTGAA 
AATGCAGAAA 
AAATCCATTT 
ATTGTTGCTT 
TTGAATCTTT 
TGTTGGAAAG 
CATACATAAG 
TCAATGACAA 
TGTTAGCTAT 
AGCAGAACCT 
TTAAATGGGA 
CATTTTAAAC 
AAGAATGTGT 
ATATGACATT 



TCTGCATTAG 
AGTCTTTAAA 
ATGAAATGCT 
TCTGATTGAC 
TGGTACTTTC 
GAATCTTGAG 
GAAATTTCAT 
GCAACGTGGT 
AGGCAATGTA 
AGTCAGGATG 
ACCAATTTCA 
TTACTTTGAC 
CCAACTTAGA 
TCAGGTATGT 
AGAGTCATAA 
CCTCCTGCGT 
AAAC 



CGTGCTCTCC 
TGGAAATTGT 
TTGATGTGAA 
TCTGTGGTTT 
AGACCACAGC 
GTTCAGATCT 
TGCTTATATC 
AGACGCTCCC 
TTGAGTATGC 
TTTTTCAGTA 
TTCAAAGCTT 
CACCGTATGC 
TAGCACTTTC 
GATGAGATCT 
GGATTGGAAC 
ATCTGGGTTT 



TGGCAAGCAG 
GCAAACGAGQ 
GACGGAAAAC 
CTCAGCACAG 
TGCAGAGGGG 
TAAAAAAGCA 
TAAAAGAAAC 
ACAAAACTTT 
TGATAGAGGT 
TTATCTTTAC 
GCTTCAATAG 
ATTAAAAAGA 
CACAAAAAGT 
CAGAAGTGTT 
TAATAAAAAT 
TGAATTCTGG 
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Seq ID NOt 26 DNA sequence 

Nucleic Acid Accession #: NM_013282.2 

Coding sequence: 85.. 2466 



11 



CGACTCCTTA GAGCATGGCA 
GTCCCTCCCC TCAGCGCCGA 
ACCCACACGG TGGACTCGCT 
CAGGAGCTGT TCCACGTGGA 
GAGGACGGCC ATACCCTCTT 
GTCCGCCAGA GCCTCGTGCT 
ACCGACTCCG GCTGCTGCCT 
GCGGCCGCCG AGACTGACAG 
GGGCTGTACA AGGTCAATGA 
GAGGCGCAGG TGGTCAGGGT 
ACGTCCAGGC CGGCGCTGGA 
GAGAACGGCG TGGTCCAGAT 
AAGTGGCAGG ACCTGGAGGT 
AAGGAGCGGG GCTTCTGGTA 
CGGGAACTCT ACGCCAACGT 
TTCGTGGACG AAGTCTTCAA 
CCCATGAGAC GGAAGAGCGG 
TGCCGGGTCT GCGCCTGCCA 
TGCGATGAGT GCGACATGGC 
CCCAGCGAGG ACGAGTGGTA 
GCGGGAGAGC GGCTGAGAGA 
TCACAGCGGG ACTGGGGCAA 
GTCCCGTCCA ACCACTACGG 
CGAGTCCAGG TCAGCGAGTC 
AGCAACGACG GAGCGTACTC 
' GGGAATTTTT TCACATACAC 
GCGGAACAGT CTTGTGATCA 
TTTGCTCCCA TCAATGACCA 
GTCAGGGTGG TGCGCAATGT 
AACCGCTACG ATGGCATCTA 
TTTCTCGTGT GGCGCTACCT 
GAGGGGAAGG ACCGGATCAA 
GAAGCCCTGG CCAACCGAGA 
CAGGAGGGGG GCTTCGCGTC 
GGAGGTGGCC CGAGCAGGGC 
CCCTACAGTC TCACGGCCCA 
CTGTGGAATG AGGTCCTGGC 
TTGTTCCTGA GTAAAGTGGA 
CGGCCCATCA CGACCGTGTG 
CGGGCACAGG TGTTCAGCTG 
CAGGTGAACC AGCCTCTGCA 
CGGTGATCTC CAAGCACTTC 
CATCGGCACT gattttgttc 
CCTAAAAAGG TTTGTCTTCC 
GAATTTATGT ATTCTGGCTA 
CATAAAAGCC TGCAATTTCT 
ACTACGTGGT GTGGAGGCTG 
CAACTCTTTA AGAAGGCGAC 
AGCAAGCATC TTCCTGACAG 
TGGCCCGTGG CAGCCCGTGG 
AAAGAGGAAA CATCTCGGGC 
TGCTIAGCGT CTGAGATCCG 
CACGCAGAAA TGGCCTCAAG 
TGTCCGACGA AGGCGGCCAC 



21 
i 

TGGCTCAGAG 
CACCATGTGG 
GTCCAGGCTG 
GCCAGGCCTG 
CGACTACGAG 
CCCCCACAGC 
GGGCCAGAGT 
CAGGCCAGCC 
GTACGTCGAT 
GACGCGGAAG 
GGAGGACGTC 
GAACTCCAGG 
GGGCCAGGTG 
CGACGCGGAG 
GGTGCTGGGG 
GATTGAGCGG 
GCCGTCCTGC 
CCTGTGCGGG 
CTTCCACATC 
CTGCCCTGAG 
GAGCAAGAAG 
GGGCATGGCC 
ACCCATCCCG 
GGGTGTCCAT 
CCTAGTCCTG 
GGGTAGTGGT 
GAAACTCACC 
AGAAGGGGCC 
CAAGGGTGGC 
CAAGGTTGTG 
TCTGCGGAGG 
GAAGCTGGGG 
GCGAGAGAAG 
CCCCAGGACG 
CGGGTCCCCG 
GCAGAGCAGC 
GTCACTCAAG 
GGAGACGTTC 
CCAGCACAAC 
CCCTGCCTGC 
GACCGTCCTC 
TCGACAGGCG 
TTAGTGGGCT 
TTTTTTTTTA 
AAAGTTGGAC 
CGACAAAACA 
TTGATGTTTC 
AGGATCAGTC 
CATTTTGTCA 
CATGGCGTGG 
CTAGTTCAAA 
CGTGAAAAGT 
GGGACTCTGC 
GGACGGACGC 



31 



GTGCTGGTAA 
ATCCAGGTTC 
ACCAAGGTGG 
CAGAGGCTGT 
GTCCGCCTGA 
ACCAAGGAGC 
GAGTCAGACA 
GATGAGGACA 
GCTCGGGACA 
GCCCCCTCCC 
ATTTACCACG 
GACGTCCGAG 
GTCATGCTCA 
ATCTCCAGGA 
GATGATTCTC 
CCGGGTGAAG 
AAGCACTGCA 
GGCCGGCAGG 
TACTGCCTGG 
TGCCGGAATG 
AAGGCGAAGA 
TGTGTGGGCC 
GGGATCCCCG 
CGGCCCCACG 
GCGGGGGGCT 
GGTCGAGATC 
AACACCAACA 
GAGGCCAAGG 
AAGAATAGCA 
AAATACTGGC 
GACGATGATG 
CTGACCATGC 
GAGAACAGCA 
GGCAAGGGCA 
CGCCGGACAT 
CTCATCAGAG 
GACCGGCCGG 
CAGTGTATCT 
GTGTGCAAGG 
CGCTACGACC 
AACCAGCTCT 
TTTTGCTGAA 
TAACTTAAAC 
TTTTTATTTT 
TTCTCAGTAT 
ACACAAGATT 
TGGTGTCAAG 
CTTCTCTAGG 
TCTAAAGTCC 
CTCAGCTGTC 
CCTTTGCCTC 
CCTCTGCCCA 
TCCACGTGGG 
CAGCACAOGA 



41 
I 

AACTGATGGG 
GGACCATGGA 
AGGAGCTGAG 
TCTACAGGGG 
ATGACACCAT 
GGGACTCCGA 
AGTCCTCCAC 
TGTGGGATGA 
CGAACATGGG 
GGGACGAGCC 
TGAAATACGA 
CGCGCGCCCG 
ACTACAACCC 
AGCGCGAGAC 
TGAACGACTG 
GGAGCCCCAT 
AGGACGACGT 
ACCCCGACAA 
ACCCGCCCCT 
ATGCCAGCGA 
TGGCCTCGGC 
GCACCAAGGA 
TGGGCACCAT 
TGGCTGGCAT 
ATGAGGATGA 
TTTCCGGCAA 
GGGCGCTGGC 
ACTGGCGGTC 
AGTACGCCCC 
CCGAGAAGGG 
AGCCTGGCCC 
AGTATCCAGA 
AGAGGGAGGA 
AGTGGAAGCG 
CCAAGAAAAC 
AGGACAAGAG 
CGAGCGGCAG 
GCTGTCAGGA 
ACTGCCTGGA 
TGGGCCGCAG 
TCCCCGGCTA 
AACGTGTCGG 
AGGTAGTGTT 
TCAAATCTAT 
TGTGTTTAGT 
TTTTAAAGAT 
TTCTCAGAAG 
GTTCTGGCCC 
AGTGACATGG 
TGTTGAAGTT 
AAAGCCATCC 
CGAGAGCAGG 
GCCAGGCGTG 
AGTCACGTGC 



51 



GGTTTTTGCT 
CGGGAGGCAG 
GCGGAAGATC 
CAAACAGATG 
CCAGCTCCTG 
GCTCTCCGAC 
CCACGGCGAG 
GACGGAATTG 
GGCGTGGTTT 
CTGCAGCTCC 
CGACTACCCG 
CACCATCATC 
CGACAACCCC 
CAGGACGGCG 
TCGGATCATC 
GGTTGACAAC 
GAACAGACTC 
GCAGCTCATG 
CAGCAGTGTT 
GGTGGTACTG 
CACATCGTCC 
ATGTACCATC 
GTGGCGGTTC 
ACAOGGCCGG 
CGTGGACCAT 
CAAGAGGACC 
TCTCAACTGC 
GGGGAAGCCG 
CGCTGAGGGC 
GAAGTCCGGG 
TTGGACGAAG 
AGGCTACCTG 
GGAGGAGCAG 
GAAGTCGGCA 
CAAGGTGGAG 
CAACGCCAAG 
CCCGTTCCAG 
GCTGGTGTTC 
CAGATCCTTT 
CTATGCCATG 
CGGCAATGGC 
AGGGCTCGTT 
TCCTCCGTTC 
ACATTTTCAG 
TCTTTGAAAA 
GGAATCAGAA 
TTGCTGCCAC 
CCAAGGTCAG 
TTCCCCGTGG 
QTTGCAAGGA 
CCCACCAGAC 
GAGTTGGGGC 
TGACTGACGC 
AAGTGCCTTT 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2B20 

2880 

2940 

3000 

3060 

3120 

3180 

3240 



237 



WO 03/003906 



GATTCGTTCC TTCTTTCTAA AGACGACAGT CTTTGTTGTT AGCACTGAAT TATTGAAAAT 3300 

GTCAACCAGA TTCTAGAAAC TGCGGTCATC CAGTTCTTCC TGACACCGGA TGGGTGCTTG 3360 

GGAACCGTTT GAGCCTTATA GATCATTTAC ATTCAATTTT TTTAACTCAG CAAGTGAGAA 3420 

CTTACAAGAG GGTTTTTTTT TAATTTTTTT TTCTCTTAAT GAACACATTT TCTAAATGAA 3480 

TTTTTTTTGT AGTTACTGTA TATGTACCAA GAAAGATATA ACGTTAGGGT TTGGTTGTTT 3540 

TTGTTTTTOT ATTTTTTTTC TTTTGAAAGG GTTTGTTAAT TTTTCTAATT TTACCAAAGT 3600 

TTGCAGCCTA TACCTCAATA AAACAGGGAT ATTTTAAATC ACATACCTGC AGACAAACTG 3660 

GAGCAATGTT ATTTTTAAAG GGTTTTTTTC ACCTCCTTAT TCTTAGATTA TTAATGTATT 3720 

AGGGAAGAAT GAGACAATTT TGTGTAGGCT TTTTCTAAAG TCCAGTACTT TGTCCAGATT 3780 

TTAGATTCTC AGAATAAATG TTTTTCACAG ATTGAAAAAA AAAAAAAA 3 828 

Seq ID NO: 27 Protein Sequence 
Protein Accession ft: NP_037414.2 

1 11 21 31 41 51 

MWIQVRTMDG RQTHTVDSLS RLTKVEELRR KIQELFHVEP GLQRLFYRGK QMEDGKTLPD 60 

YEVRLNDTIQ LLVRQSLVLP HSTKERDSEL SDTDSGCCLG QSESDKSSTH GEAAAETDSR 120 

PADEDMWDET ELGLYKVNEY VDARDTKMGA WPEAQWRVT RKAPSRDEPC SSTSRPALEE 180 

DVIYHVKYDD YPENGWQMN SRDVRARART IIKWQDLEVG QWMLNYNPD NPKERGFWYD 240 

AEISRKRETR TARELYANW LGDDSLNDCR IIFVDEVPKI ERPGEGSPMV DNPMRRKSGP 300 

SCKHCKDDVN RLCRVCACHL CGGRQDPDKQ LMCDECDMAP HIYCLDPPLS SVPSEDEWYC 360 

PECRNDASEV VLAGERLRES KKKAKMASAT SSSQRDWGKG MACVGRTKEC TIVPSNHYGP 420 

IPGIPVGTMH RFRVQVSESG VHRPHVAGIH GRSNDGAYSL VIAGGYEDDV DHGNFFTYTG 480 

SGGRDLSGNK RTAEQSCDQK LTNTNRALAL NCPAPINDQE GAEAKDWRSG KPVRWRNVK 540 

GGKNSKYAPA EGNRYDGIYK WKYWPEKGK SGFLVWRYLL RRDDDEPGPW TKEGKDRIKK 600 

LGLTMQYPEG YLEALANRER EKENSKREEE EQQEGGFASP RTGKGKWKRK SAGGGPSRAG 660 

SPRRTSKKTK VEPYSLTAQQ SSLIREDKSN AKLWNEVLAS LKDRPASGSP FQLFLSKVEE 720 

TFQCI CCQEL VFRPITTVCQ HNVCKDCLDR SFRAQVFSCP ACRYDLGRSY AMQVNQPLQT 780 
VLNQLFFGYG NGR 



Seq ID NO: 28 DNA sequence 
Nucleic Acid Accession 8* NM_000756.1 
Coding sequence : 186 . .776 

1 IX 21 31 |1 5 1 

AGAAACTCAG AGACCAAGTC CATTGAGAGA CTGAGGGGAA AGAGAGGAGA GAAAGAAAAA 
GAGAGTGGGA ACAGTAAAGA GAAAGGAAGA CAACCTCCAG AGAAAGCCCC CGGAOACGTC 
TCTCTGCAGA GAGGCGGCAG CACCCGGCTC ACCTGCGAAG CGCCTGGGAA GCGAGTGCCC 
CTAACATGCG GCTGCCGCTG CTTGTGTCCG CGGGAGTCCT GCTGGTGGCT CTCCKXOT 
GCCCGCCATG CAGGGCGCTC CTGAGCCGCG GGCCGGTCCC GGGAGCTCGG CAGGCGCCGC 
AGCACCCTCA GCCCTTGGAT TTCTTCCAGC CGCCGCCGCA GTCCGAGCAG CCCCAGCAGC 
CGCAGGCTCG GCCGGTCCTG CTCCGCATGG GAGAGGAGTA CTTCCTCCGC CTGGGGAACC 

^Sagag CCCGGCCGCT cccctttcgc ccgcctcctc gctcctcgcc ggaggcagcg 

GCAtc^CCG TTCGCCGGAA CAGGCGACCG CCAACTTTTT CCGCGTGTTG CTGCAGCAGC 540 
S TC^CGCTCG CTCGACAGCC CCGCGGCTCT CGCGGAGCGC GGCGCTAGGA 600 
ATGCCCTCGG CGGCCACCAG GAGGCACCGG AGAGAGAAAG GCGGTCCGAG GAGCCTCCCA 

tctSSgg^ TCTCACCrrC cacctcctcc gggaagtctt ggaaatggcc agggccgagc 

AGTTAGCACA GCAAGCTCAC AGCAACAGGA AACTCATGGA GATTATTGGG AAATAAAACG 

ctgcgtttgg ccaaaaagaa tctgcattta gcacaaaaaa aatttaaaaa aatacagtat 

TCTGTACCAT AGCGCTGCTC TTATGCCATT TGTTTATTTT TATATAGCTT GAAACATAGA 
GGGAGAGAGG GAGAGAGCCT ATACCCCTTA CTTAGCATGC ACAAAGTGTA TTCACGTGCA 

g^IgSacac aatgttattc gttttgtcta cgtttagttt ccgtttccag gtgtttatag 

?S^A £2flT SScTGTGA GAAAACGTTT TGTTTGAAAA AGCAGACAGA 
AGTCACTCAA TTGTTTTTGT TGTGGTCTGA GCCAAAGAGA ATGCCATTCT CTTGGGTGGG 
SSSS TCTGTAAGCT CTTTGAAACA ACTTTCTCTT GTAAACGTTT CAGTAA^AA 
ACATCTTTCC AGTCCTTGGT CAGTTTGGTT GTGTAAGAGA ATGTTGAATA CTTATATTTT 
TAATAAAAGT TGCAAAGGT 

Seq ID NO: 29 Protein Sequence 
Protein Accession 8: NP_000747.1 

X 11 21 31 41 51 

LlPLLVSAG il^VALLPCP PCRALLSRGP VPGARQAPQH PQPLDFFQPP PQSEQPQQPQ 

ARPVLLRMGE EYFLRLGNLN KSPAAPLSPA SSLLAGGSGS RPSPEQATAN FFRVLLQQLL 

LPRRSLDSPA ALAERGARNA LGGHQEAPER ERRSEEPPIS LDLTFHLLRE VLEMARAEQL 
AQQAHSNRKL MEIIGK 



60 

120 

180 

240 

300 

3 60 

420 

480 



600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 



60 
120 
180 



Seq id NO: 30 DNA sequence 
Nucleic Acid Accession 8: CAT cluster 

11 21 31 41 51 

| | | I I I ^——p , n 

ATTTCTTGAG ATGGCTCCTC TTGGAACTAT GCATCTCTCC TGATACTTGG ATGCTTTTCC 60 
TCTGACTGAT GAAGATCCTG AATACCAAAG AGGGCCGCTG AAAGGGCTCT AGGAGTACAC 

238 



120 



WO 03/003906 



CTTCTAGGAA 
CTTTGAAGAG 
TGAOGTTGCT 
CATTTGGTTT 
AGAAATGGGT 
AACTCCCTAA 
AATGTGTTTG 
TGTGCTCAAT 



CCCTAAGCCA 
CAGTCAGACT 
GGGGGAACTC 
GGGTGTCAGT 
TCAAATTTTA 
AATTCAGTTT 
TGAAGATTAA 
AAGTGTTTAT 



GAGAGAGGCT 
CCTGCTTTGC 
TTTTACAACA 
AGCATGATGG 
CCTGCAACTC 
TTTCTTTGTA 
AAAAGTTAAC 
TTATTTATTG 



TCACTACATC 
TCTTCAGACT 
TCTCTCTGAA 
AAAGACAAAA 
ACCAATTCTG 
AAATGGGGTT 
ATAAAGAGTT 
CTGAATAAAC 



ATGCTTCCTG 
TAATTTGGGG 
AGAATCCGGG 
AAACACAACT 
TGGCCTTGGT 
ATGAACAGTA 
TAGAAGAGTG 
CAGTAATTTA 



ACATCTCTCC 
GTTTAACAGG 
CTGCCAGTTT 
TGACATCTGC 
TCAGCAATTA 
CCTACTTCAA 
TCTGGCATAT 
ATTAGTAT 



180 
240 
300 
360 
420 
480 
540 



Seq ID NO: 31 DNA sequence 
. Nucleic Acid Accession ft : Eos sequence 
Coding sequence: 134.. 1917 



11 



21 



CACCTCAGCC 
TCAGGAGAGT 
AAACCTTATT 
GAGTGTCCCG 
GGACAGACAC 
GGTTGAGGCT 
AGACCCTCCC 
CCAAGCCGCG 
GGCCTCCTCC 
CGAAATCGCC 
GGAGGCGCCG 
GGAGGAGGCC 
GGGGCCCCGG 
GCGGCTGGAC 
CGGCCTGGCG 
CCCGTGCCTC 
CGGCCGGGGC 
CTCAGGGCTC 
GCCTCTACCT 
CCAGGTGCCC 
GTTTCTGGCG 
CTGCTACAAC 
CCAGCTCAAG 
CAGCATGGAG 
AGGCACCTTC 
GGGTCACCTC 
CCCATACAGT 
CCGCTTCGAC 
CTCCTGGAGC 
CCTGGGCAAC 
GGGGACTGCC 
CCTCAGTCCA 
GGCAGGCAGA 
TGAGAGGCCG 
AACTTTCCCC 
CTTTCTGCCC 
GACCCTCTCA 
GACTGATGCG 
GAGGGGGTAG 
TCCCCAGGCA 
CAACCCCGTA 
GCCTCTGAAA 
GCCCACACCA 
CCAGCCCCTC 
AGCGTCACCC 
CCTTTCTCCT 
AGACAAGATC 
ATGCGTTTTC 
GTGCTCCACT 
TCTATGTTCA 



CCAACCCCAA 
GTGGCTCTCC 
GCCATGGTTC 
AGGGCGGTTC 
GGCCCCTCTT 
GGAGGTCAGC 
CCAGGCCTAA 
ATGCCCAGGG 
TCTGCGAGGC 
CCGAGCTCGG 
GGGGAGGGGG 
CGGAAGCTCA 
AAGCCCAGCT 
CTGGGCAGTT 
CAGGAGACCT 
TACCGCCGGC 
CGGGCGGTGC 
CCCAGGGGCC 
GCGCACCTGC 
GAGGAGGCCC 
GGGGGCATCC 
CCTCTGACCA 
CTGGTGGCCC 
TGCTACGACC 
CCTGTGGCCC 
TTCTACCGCC 
GCCAGCCACC 
CTGCTGCGGG 
AGGGCTGCCT 
ACCATTTACT 
CAGTTCCAGG 
TTCATCCTGA 
GAACCAAAGC 
GGGCTCAGGG 
TTCTGCTTTA 
CTCACTCCAC 
GCTTGCTGAC 
TCTCCACAGA 
AAAACATTCA 
ACGCCCTCGC 
GCTATCTGGG 
CTGGTCCGCT 
GGCCAGGCCC 
CAGAAGCCCA 
TAGTTCTGCC 
TGTCTATAAA 
ACAGCTATGA 
TCCCCTTACG 
CCAACCTCCA 
GCACCACTGG 



CCCCAGCCGC 
CCAGGCGCTA 
TTAGAAGCCA 
CCGGGAGCCC 
CTTCAGTGGG 
CACAGCCAAG 
GAGGAGAGGG 
GCCCCGCACA 
GCTCACAGCC 
AGCAGGAGGT 
GCAGCCCTGC 
TGGTGTTTCT 
CCCGGGCCCT 
GCCTGGACGT 
ACGCGCTGAT 
TGAGCGCGGC 
TGGGCGTCCT 
CTCGTGGCGA 
ATGTGTTCAA 
CGCTTCGGGG 
GTGGCTCCGG 
ACATCTGGAG 
TGGACGGGCT 
CGCGAACAGA 
ACGAGGCTGT 
TGCTCAGGTA 
GGCGTTCCAG 
GCGTGGGCGC 
CCCTGCCCCT 
GCCTCAACCC 
CCAAGGAGCT 
CTCTGCCCCC 
TGCTTCGCTG 
AAGGGGCTGG 
AAGGTTGTCG 
ACCCAGACTG 
ACCCCCCTGT 
CAAGGACTTG 
CACTTCCTAT 
CTCTGGGACT 
TCTGTTTGGC 
GGTGCAGCCC 
ACTCCGGGCT 
CCCCACTTCT 
CCTTTTTAGC 
ATGTGGACAG 
GCACCTCGCA 
CACTTTGAAA 
GCCTGGATGT 
CACCAAATAC 



31 
I 

ATCCCCTGCC 
CCAGGAGGGG 
CCCCTTCCCC 
CGTGGGTCCC 
GACAGTCATA 
AAGCTCCGAG 
AACCAGGGAG 
GCCCCCCGCG 
GGTACCCCAG 
CAGGCCGGCC 
CGGCCGCAGC 
GCAGAGGCCC 
GGAGCC CGCC 
GCTGGCCTTT 
GAGCGACAAC 
CGACCGCGAG 
CGTACTGCCC 
GGAGCCTCCT 
CCCCCGGGAG 
CTGCGGTCTC 
TGCCAAGGCC 
CCAGGTTCGG 
GCTCTATGCC 
CGCCTGGACC 
GGCCTGCCGT 
CAGCCCCGTG 
CGACATCGTG 
CGCCGTGATG 
GCCCGCCCCC 
CCAGGTCACT 
GCAGCCCTTC 
TGAGGACCGG 
CTCTCCAGGG 
GATCGGAACT 
ATTATTTTGA 
TTTCCTGACT 
CTGTGGGACT 
GCTCGCTGGA 
GCTCTGTCAG 
TTCTGCCTGT 
ACTGTGGATT 
TGCTGTCTGC 
CACCACCCTC 
CGCCAACCCC 
TGTGTAGACT 
TGGACGTCTG 
CGGTGTCCAG 
CCCATGCTAG 
CCCTGTCTGG 
ATTTTAATTC 



41 



CCAGCTGACG 
CAGGTCTCAG 
AGGCAAGACA 
AGCACTTCCA 
GGGACAGGTA 
ACCAACGGAT 
AAAAGTCTAG 
CAGAGGCCGC 
CTACGGAAAC 
GCCTCGGGGG 
GGGGCGCTCA 
GGGGGTTGGG 
ACGGCGGCAG 
GCCCAGCAGC 
CTGCTGCGAG 
CGCATCCTCA 
AGCCTCTACC 
GCGGCGGCCC 
AACACCTGGC 
TGCACCATGC 
GTCTGCTCCA 
CCCATGCAGC 
ATCGGTGGCG 
CCACGCGCGC 
GGGGACATCT 
AAGGATGCTT 
GCACTGGGGG 
CGCTACAACA 
GCCCCACTGC 
GCCACCTTCA 
CCCTTGGGGA 
CTGCAGACCT 
AGACCCTCCT 
TCCTGCTCTT 
AGCCCAGACT 
CAATTCCGTA 
CCCTATTCCC 
GCTCTGCTGA 
CAGGACAGGG 
CCTAAGGCCT 
CTCAAGGGCC 
AGCTCCTGCC 
TGCAGCCTTG 
CGATCTCTAA 
TGGACGAGAC 
TCACCCAAGA 
GATGCACAGC 
AAAAGTGAAT 
GCCCTTTTTC 
ACCGAAAGCA 



51 
I 

GGTCAAAGCC 
CCAGCTGGGG 
GGCCCCAAGG 
CACACTCTGA 
CAGGGGGCCT 
CGCCCAGCCC 
ACCCGCTGCC 
CTGGCCCCGC 
GCAGCAGGTG 
ACCCTCAAGG 
CGGAAAAGCA 
GGGTGGTGGA 
CCCTGCGGCG 
ACGGAGAGCC 
TGCTGGGAGA 
GCCTGCGGAC 
AGGGGGGCCG 
CTGTGTCCCT 
GGCCCCTGAC 
ACAACTACCT 
ACGAGGTCTT 
AGGCCCGAGC 
AATG CCTGTA 
CACTCCCCGC 
ACGTCACCGG 
GGGACGAGTG 
GCTT CCTGTA 
CAGTGACCGG 
ACTGCACCAC 
CGGTCTCTGG 
GCACCGGGGT 
CACTCTGAGT 
GGGATGGGCC 
GTTTCTGGAC 
CCCTCAGCCT 
CCTACTTACA 
TAGAGCCAGG 
GCCGAGAGAG 
AGCAAAAACG 
CCCCAGGTAC 
TAGAACCCTT 
CATACCCCCA 
TGGGGCTCTC 
ATGAGGCCTG 
ATTTGACTTC 
GAGTTGTGGG 
ACAATCCATG 
ACATCTGACT 
TGTTTTTTAT 



Seq ID NO: 32 Protein Sequence 
Protein Accession ft: XP_054631.1 



I 

MPRGPAQPPA 
GEGGSPAGRS 
LGSCLDVXAF 
RAVLGVLVLP 
EEAPLRGCGL 
LVALDGLLYA 
PYRLLRYSPV 



11 
I 

QRPPGPAASS 
GALTBKQEEA 
AQQHGEPGLA 
SLYQGGRSGL 
CTMHNYLFLA 
IGGECLYSME 
KDAWDECPYS 



21 

I 

SARRSQPVPQ 
RKLMVFLQRP 
QETYALMSDN 
PRGPRGEEPP 
GGIRGSGAKA 
CYDPRTDAWT 
ASHRRSSDIV 



60 
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2160 
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2400 
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2700 

2760 

2820 

2680 

2940 



31 41 51 

I I I 

LRKRSRCEIA PSSEQEVRPA ASGDPQGEAP 
GGWGWEGPR KPSSRALEPA TAAALRRRLD 
LLRVLGDPCL YRRLSAADRE RILSLRTGRG 
AAAPVSLPLP AHLHVPNPRB NTWRPLTQVP 
VCSNEVFCYN PLTNIWSQVR PMQQARAQLK 
PRAPLPAGTF PVAHEAVACR GDI YVTGGHL 
ALGGFLYRFD LLRGVGAAVM RYNTVTGSWS 

239 



60 

120 

180 

240 

300 

360 

420 



WO 03/003906 



RAASLPLPAP APLHCTTLGN TIYCLNPQVT ATFTVSGGTA QPQAKELQPF PLGSTGVLSP 
FILTLPPEDR LQTSL 



480 



Seq ID NO: 33 DNA sequence 

Nucleic Acid Accession ft: CAT cluster 



1 
I 

CCTCTGTCTA 
AATCCCCTTG 
AAACTCTGTG 
TTTOAAACGA 
GCAAATTGCA 
GAGTTGAATG 
AAGAAAACCC 
CAAACAGAGT 
ACACATCGCA 
TTTTCTGCCT 
GTTTCAAATC 
AAAGAAGTTA 
AGGCATCTAA 
GGCTGTATGA 



11 
I 

ACCTTACATG 
CAGACTTTAC 
AGTTGAACCA 
AGATATTTCC 
CAAATAGAGT 
CACACAACAC 
GTTTCCAACG 
GTTTCCAAAC 
GAGCAGTTTC 
TTGGCCTCAA 
TGCTCTGTCT 
CTGAGAATTC 
GAGGTCCAAA 
AAAGAAAGGT 



21 
! 

AAAAAACCCG 
AAACAGAGTG 
ACACATCACA 
TTTTCTGCCG 
GTTTCAAATC 
AAGGAAGTTA 
AAGGCCTCTA 
TGCTGAATGA 
TGAGAATGAT 
AGCGCTTGAA 
AAAGGAAGGT 
TTCTGTCTAG 
TATCCACTTG 
TAAACTCTGT 



31 
I 

TTTCCAACGA 
TTTCCTAACT 
ACGCAGTTTG 
TTGACCTTAA 
TGCTCTGTCT 
CTGGGAATTC 
AGTGGTCAAA 
AAAGAAAAGT 
TCTGTCTAGT 
ATCTCCACTT 
TCAACTCTGT 
CATTATATGA 
CAGACTTTAC 
GAGTTAAACA 



41 
I 

AGGCCTCTAA 
GCTCTATGAA 
TGGGAATGAT 
AGCGCTTGAA 
AAGGGAACGT 
TTCTGTCTAG 
ATATCCACGT 
AAACTCTGAG 
TTTTATACGA 
GCAAATTCCA 
GAGTTGAATA 
AGAAATCCCG 
AAACAGAGGG 
CACA 



SI 
I 

AAGGCCAAGA 
CAGAAAGGTT 
TCTGTCTAGT 
ATCTACACTT 
TCAACTCTGT 
CCTTACATGA 
GCAGACTTTA 
AGTTGAACGC 
AGATATTTCC 
CAAAAAGAGT 
CACACAACAC 
TTTCCAACGA 
TTTCCAGAAT 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 



Seq ID NO; 34 DNA sequence 

Nucleic Acid Accession ft: AF01146B.1 

Coding sequence: 257.. 1468 



GGAAGACTTG 
ATCTCAGTGG 
CAAGTCCCCT 
GCGCCTTTGC 
TACAGCTAGA 
TACAGCTCCA 
TCCATTACCT 
GCGCGTTCCT 
GCAGAAGCAA 
CCAAAAGAGC 
ATCAAAACAG 
TGGTCGCCCT 
CAAGTTTATT 
GCATCAGCTC 
ACTGTATGGT 
TGGAACAGTT 
TTATATAACA 
AGACATTAAG 
TGGGTGGTCA 
CCTGCCCCCT 
TGGAGTTCTT 
AGAGACCTAC 
AGCCAGGGAC 
AGAAGTACTT 
CAAAGAATCA 
GGCTGCCATA 
CCTCAATCTA 
CCCTATTCAG 
AATTGTGCTA 
CAGCCTGTGC 
AGTGACCACT 
CTGAGTGAGT 
GTGATTCTTT 
GGCATTCCTT 
TAGTCCTCCT 
ACCTCCATTT 
GGCTTATTTT 
AATAAGTATC 



11 
1 

GGTCCTTGGG 
CGGACGAGGA 
GTCGGTTCCT 
ATCCGCTCCT 
GGCATCATGG 
GTTGGAGGTC 
GTAAATAGTG 
TTGCAAGCAC 
TTGCAGGCAA 
AAGCAGCCCC 
AAAAATGAAG 
CTGGGTAAAG 
CTGGCTCTTA 
AGAAGAGAAG 
TATTTCCATG 
TATAGAGAAC 
GAATTGGCAA 
CCAGAGAACT 
GTACATGCTC 
GAAATGATTG 
TGCTATGAAT 
AAAAGAATAT 
CTCATTTCAA 
GAACACCCCT 
GCTAGCAAAC 
TAACCTGACA 
GAACGCTACA 
AAAGCTCCAC 
CTTATACTGG 
TAGGCATGGT 
CTGCCCTGAC 
GTGTAACTTA 
TTAAGTATGA 
TAGGAATGCT 
TAACCACTTA 
AGGGATTTGC 
TTTAAAACAT 
TATGTCTAAA 



21 
I 

TCGCAGGTGG 
CGGCGGGGAC 
CCGTCCCTGA 
GGGCACCGAG 
ACCGATCTAA 
CAAAACGTGT 
GCCAGGCTCA 
AAAAGCTTGT 
CCAGTGTACC 
TGCCATCGGC 
AATCAAAAAA 
GAAAGTTTGG 
AAGTGTTATT 
TAGAAATACA 
ATGCTACCAG 
TTCAGAAACT 
ATGCCCTGTC 
TACTTCTTGG 
CATCTTCCAG 
AAGGTCGGAT 
TTTTAGTTGG 
CACGGGTTGA 
GACTGTTGAA 
GGATCACAGC 
AGTCTTAGGA 
GGAACATGCT 
CAAGAAATAT 
ATCAATAAAC 
TTCATAATCT 
GTCTTCACAG 
CCCGATCAGT 
TTGGGTTGGC 
AAATAAAGAT 
GTGTGTCTGT 
TCTCCCATAT 
TTGGGATACA 
TGGAGTCATA 
AAAAAAAAAA 



31 
I 

GAGCCGACGG 
AAGGGGCGGC 
GTGTCCTTGG 
GCGCCCTGTA 
AGAAAACTGC 
TCTCGTGACT 
GCGGGTCTTG 
CTCCAGTCAC 
TCATCCTGTC 
ACCTGAAAAT 
GAGGCAGTGG 
TAATGTTTAT 
TAAAGCTCAG 
GTCCCACCTT 
AGTCTACCTA 
TTCAAAGTTT 
TTACTGTCAT 
ATCAGCTGGA 
GAGGACCACT 
GCATGATGAG 
GAAGCCTCCT 
ATTCACATTC 
GCATAATCCC 
AAATTCATCA 
ATCGTGCAGG 
ACTGAAGTTT 
TTGTTTTACT 
ATGACACTCT 
GGAGGCAAGG 
GAGGCAAATC 
TAAGGAGCTG 
GAAGCCTGGT 
ATATGTACAG 
CCGGCACCCC 
GAGAGTGTGA 
GAAGAGGCCA 
GCATGTGTGT 



41 

I 

GTGGGTAGAC 
TGGTCGGAGT 
CGCTGCCTTG 
GGATACTGCT 
ATTTCAGGAC 
CAGCAAATTC 
TGTCCTTCAA 
AAGCCGGTTC 
TCCAGGCCAC 
AATCCTGAGG 
GCTTTGGAAG 
TTGGCAAGAG 
CTGGAGAAAG 
CGGCATCCTA 
ATTCTGGAAT 
GATGAGCAGA 
TCGAAGAGAG 
GAGCTTAAAA 
CTCTGTGGCA 
AAGGTGGATC 
TTTGAGGCAA 
CCTGACTTTG 
AGCCAGAGGC 
AAACCATCAA 
GGGAGAAATC 
ATTTTACCAT 
CAGCAGGTGT 
GAAGTGAAAG 
TTCGACTGCA 
CAGAGCCTGG 
TGCAATAACC 
AAAGCTGTTG 
ACTTGTATTT 
GGTAGGCCTG 
AAAATAGGAA 
TGTGTCTCAG 
AAACTTTAAA 



51 



I 

CGTGGGGGAT 
GGCGGAGCGT 
TGCCCGCCCA 
TGTTACTTAT 
CTGTTAAGGC 
CTTGTCAGAA 
ATTCTTCCCA 
AGAATCAGAA 
TGAATAACAC 
AGGAACTGGC 
ACTTTGAAAT 
AAAAGCAAAG 
CCGGAGTGGA 
ATATTCTTAG 
ATGCACCACT 
GAACTGCTAC 
TTATTCATAG 
TTGCAGATTT 
CCCTGGACTA 
TCTGGAGCCT 
ACACATACCA 
TAACAGAGGG 
CAATGCTCAG 
ATTGCCAAAA 
CTTGAGCCAG 
TGACTGCTGC 
GCCTTAACCT 
TAGCCACGAG 
GCCGCCCCGT 
CTGTGGGGAA 
TTCCTAGTAC 
GAATGAGTAT 
TTTCTCTGGT 
ATTGGGTTTC 
CACGTGCTCT 
AGCTGTTAAG 
TATGCAAATA 



Seq ID NO: 35 Protein Sequence 
Protein Accession ft: AAC63 902.1 



11 



21 
1 



MDRSKENCIS GPVKATAPVG GPKRVLVTQQ 
AQKLVSSHKP VQNQKQKQLQ ATSVPHPVSR 
EESKKRQWAL EDFEIGRPLG KGKFGNVYLA 
EVEIQSHLRH PNILRLYGYF HDATRVYLII» 
ANALSYCHSK RVIHRDIKPE NLLLGSAGEL 
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31 41 51 

I I 1 

IPCQNPLPVN SGQAQRVLCP SNSSQRVPLQ 
PLNNTQKSKQ PLPSAPENNP BEELASKQKN 
REKQSKFILA LKVLFKAQLE KAGVEHQLRR 
EYAPLGTVYR ELQKLSKPDE QRTATYITEL 
KIADFGWSVH APSSRRTTLC GTLDYLPPEM 

240 
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IEGRMHDEKV DLWSLGVLCY EFLVGKPPFE ANTYQETYKR ISRVEFTFPD FVTEGARDLI 360 
SRLLKHNPSQ RPMLREVLEH PWITANSSKP SNCQNKESAS KQS 

Seq ID NO: 36 dna sequence 

Nucleic Acid Accession #: NM_016267 

Coding sequence: 67.. 843 

1 11 21 31 41 51 

I I I I I I 

CGGCACGAGG ACAGAATCAG GTGATQGTCC AGAATTAAGA GCTGTCACCT GTGTCATTCA 60 

CTCACAATGG AAGAAATGAA GAAGACTGCC ATCCGGCTGC CCAAAGGCAA ACAGAAGCCT 120 

ATAAAGACGG AATGGAATTC CCGGTGTGTC CTTTTCACCT ACTTCCAAGG GGACATCAGC 180 

AGCGTAGTGG ATGAACACTT CTCCAGAGCT CTGAGCAATA TCAAGAGCCC CCAGGAATTG 240 

ACCCCCTCGA GTCAGAGTGA AGGTGTGATG CTGAAAAACG ATGATAGCAT GTCTCCAAAT 300 

CAGTGGCGTT ACTCGTCTCC ATGGACAAAG CCACAACCAG AAGTACCTGT CACAAACCGT 360 

GCCGCCAACT GCAACTTGCA TGTGCCTGGT CCCATGGCTG TGAATCAGTT CTCACCGTCC 420 

CTGGCTAGGA GGGCCTCTGT TCGGCCTGGG GAGCTGTGGC ATTTCTCCTC CCTGGCGGGC 480 

ACCAGCTCCT TAGAGCCTGG CTACTCTCAT CCCTTCCCCG CTCGGCACCT GGTTCCAGAG 540 

CCCCAGCCTG ATGGGAAACG TGAGCCTCTC CTAAGTCTCC TCCAGCAAGA CAGATGCCTA 600 

GCCCGTCCTC AGGAATCTGC CGCCAGGGAG AATGGCAACC CTGGCCAGAT AGCTGGAAGC 660 

ACAGGGTTGC TCTTCAACCT GCCTCCCGGC TCAGTTCACT ATAAGAAACT ATATGTATCT 720 

CGTGGATCTG CCAGTACCAG CCTTCCAAAT GAAACTCTTT CAGAGTTAGA GACACCTGGG 780 

AAATACTCAC TTACACCACC AAACCACTGG GGCCACCCAC ATCGATACCT GCAGCATCTT 840 

TAGTCAAGTT GGAGGAGAAA GACAACACTT GGTCTAAGAC ACGGCAGCAA GACATCCCTG 900 

CATATTGTTC CAGATAAAAA TGAAAGCTGC TCACACCCAC TTGCCTCCCC AATCTGTTAA 960 

ACAGCTTCGT GTCTAGTATG AGCTCAGTAC TTGCCCTGTG AAAATCCCAG AAGCCCCCGC 1020 

TGTCAATGTT CCCCATCCAC ACCCTGCTTG CTCCTGTGTA ACAGCTCAGA TGATGAATAA 1080 
TAATAAAACT GTACTTTTTT GGAAAAAAAA AAAAAAAAAA AAAAAA 



Seq ID NO: 37 Protein Sequence 
Protein Accession fh NP_057351.1 

1 11 21 31 41 51 

MEEMKKTAIR LPKGKQKPIK TEWNSRCVLF TYPQGDISSV VDEHFSRALS NIKSPQELTP 60 

PSSQSEGVML KNDDSMSPNQ WRYSSPWTKP QPEVPVTNRA ANCNLHVPGP MAVNQFSPSL 120 

AARRASVRPG ELWHFSSLAG TSSLEPGYSH PFPARHLVPE PQPDGKREPL LSLLQQDRCL 180 

ARRPQESAAR ENGNPGQIAG STGLLFNLPP GSVHYKKLYV SRGSASTSLP NETLSELETP 240 
GKYYSLTPPN HWGHPHRYLQ HL 



Seq ID NO: 3 8 DNA sequence 

Nucleic Acid Accession #: AK05B088.1 

Coding sequence: 252. .1772 



AGGAAACCAA 
GCCCCAGATG 
TAACTCAGAG 
CCTGTGGTGG 
CACTTTCATT 
GCTCAACCAA 
CCCAGATGTG 
TATGCATTTC 
GTGGGGAAGC 
GGCTTTTTCA 
CACACCCCCT 
AAAAATCCCC 
AAGTCGCCTT 
AAGACCTACT 
GGTGCGTGGA 
CCGGCTGCAA 
ACTTGGTTCC 
ACAAAGTGCT 
TGCTTTTGTG 
CCGTGATGAC 
GACGGAGCTT 
ATCTGGAGGT 
TCAACCATTA 
CCCAGGCTGT 
TTGCTCTATA 
GTTTTTACAT 
ACAACCACGT 
TGGTTGACGG 
CCACGTCATC 
TAAGTTTTGC 
CCCCCCACTG 



11 
I 

GGCAAGCTCC 
TGATGAGCGC 
GCCAGTGTGA 
GCAAAAGGGC 
CATGATAATG 
CCGGTGGCGT 
CAAGGCTCTG 
ACAAATCCAG 
AGATGTGATT 
GTCTGAGACC 
GAGGGAGCTG 
TGCAAAGAGG 
CGCCACGGCC 
GGGAGTGCTG 
TGTGATGATA 
GTACAAGGAA 
TCTAGGGGGG 
GAAGTCCCCC 
GGTCTTCTTG 
ATTTTTTAAG 
GAGGCCGCTC 
GCTGCGGCAC 
CCACGCACTG 
GAGATTTGGG 
TGGATTCTTC 
GCAGAGAATA 
CAGCCTGCGA 
CAAGTGGCAG 
CTGCAAAAGC 
ATTCATCTTC 
GTCTGCATAA 



21 
I 

CCCTGTCAAA 
TTCCAGGCTT 
TGGGAGTTCC 
TTTGGAACGG 
TCGAACACGC 
TTGCCCAAAC 
AGCATAGACT 
AAGTTTTTCT 
CTCGAGTGCC 
TTGGCCAAGC 
GAGGAGCTTC 
ATCATCATTT 
CTGAAGAACC 
GCTTCCGCCC 
GCCAGACTCA 
GAGCAGCTCA 
ACGCAGATCC 
AGGTTATTTA 
CAACTGAACT 
AGCTTTCCTG 
TTCCTCTGCT 
CTTAACTTCT 
GAGAATGGGG 
CTGCTCTTTA 
TTTAAGATAA 
AAGCACACAG 
GCGGCACGCC 
GAGTTCAGGA 
CATACCTTGA 
CCAGCATCTT 
AAGAAAATAA 



31 
I 

GCACCTTGGC 
CAGGCTCAGA 
TCCACTCAGC 
TTGCTTGTCT 
ACAAAGCTCG 
AGCCTTTCTC 
TTGAGGAAGC 
TTGAGAATTT 
TGGGCTTCAA 
TCTACCTGAA 
TGCGAGCTCA 
CCTTGAAGAT 
TCTACATGAG 
ACATCCTCCA 
AGCCAAGCAC 
CCACCGGCTG 
ACCTCCACAA 
CCTTTAGTGA 
ACAAGATTCA 
AGAACTGTTG 
TGCGTCTGCA 
TCCCAGAGTC 
GCGACATGGT 
ACCAGGAGAA 
AGGGACTCAA 
ACCTGGAATC 
TGGTGAAGTA 
CAAACCAGAT 
AAATCCAAAC 
GACAGTTTCC 
AATGACATAA 



41 

CCATAAGAAG 
AGGCGCCCCC 
ACACTTCCCC 
TTTCTCTCCT 
GCTGGAACGC 
TGGGGACCTG 
TTTGAGGAAC 
CAAGAACAAG 
ATGGGAGCTC 
AGCCCTGGCG 
ATCACCTAAG 
CAATGACCCA 
TGAGGTGGAG 
GTTCAGTGGC 
CATCAAGAAA 
CGAGAAGTGG 
AATCCCACAG 
ATTCCATCTT 
GGCAATTCCG 
CTTTCTGGAC 
CGGCATCACC 
ATGGCTCGAC 
CCACCTGAAA 
TACAACTTAT 
ACATGATACT 
TCCCTCTGCG 
TGAGATCAGA 
CAAGCAGAAG 
TGTGGGCATC 
AGAAGAATCT 
AAGGGAGC 



51 
I 

AAAAGGGGGA 
AGCTCTCCTG 
TGTAAACACG 
GCGTAATTTC 
CGGGTCACTG 
CTCTCACTTT 
CCAGACAGGT 
G ACAT CCAAA 
CATCAGCCCC 
CAGGGCACCA 
AAGACCAAAG 
CTGGTCACTA 
ATTAACTTGG 
CTGTTTCAAA 
TTCTACGAGG 
CTGGAAATGA 
GACCTGCTCC 
CTGAAAACAA 
ACTTATGAAA 
CGGGACATAG 
AAAGGCAAGG 
CAGGTTACAG 
GATCTTAACA 
TCGAAAACGA 
ACCTCTTATA 
GTCTACGAGC 
GCAGAGGCCC 
TTTGGGTTGA 
CCAATCTATG 
ATGGGATTTT 
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Seq ID NOi 39 Protein Sequence 
Protein Accession ft: BAB7165B.1 

1 11 21 31 41 51 

I I I I I I 

MIMSNTHKAR LERRVTGSTN RWRLPKQPFS GDLLSLSQMC KALSIDFEEA LRNPDRLCIS 60 

QIQKFFPENF KKKDIQSGEA DVILECLGPK WELHQPRLFQ SETLAKLYLK ALAQGTTHPL 120 

REliEBLLRAQ SPKKTKEKSP AKRIIISLKI NDPLVTKVAF ATALKNLYMS EVEINLEDLL 180 

GVIASAHILQ FSGLFQRCVD VMIARLKPST IKKPYEAGCK YKEEQLTTGC EKWLEMNLVP 240 

LGGTQIHLHK IPQDLLHKVL KSPRLETFSE FHLLKTMLLW VFLQLNYKIQ AIPTYETVMT 300 

FPKSFPENCC FLDRDIGRSL RPLFLCLRLH GITKGKDLEV LRHLNFFPES WLDQVTVNHY 360 

HALENGGDMV HLKDLNTQAV RFGLLPNQEN TTYSKTIALY GFFFKIKGLK HDTTSYSFYM 420 

QRIKHTDLES PSAVYEHNHV SURAARLVKY EIRAEALVDG KWQEFRTNQI KQKFGLTTSS 480 
CKSHTLKIQT VGIPIYVSFA FIFPAS 

Seq ID NO: 40 DNA sequence 

Nucleic Acid Accession 8: NM_0 32899.1 

Coding sequence! 186.. 1070 

1 11 21 31 41 51 

| | | | I I 

GGCACGAGGA TTGCTCAGGA CAGCGGTAAA TCACTTCTTG GAGGTGCCCT GCACGCCGGT 60 

CCTGGGAGCA GGCGaCCTCC CGGGGGTGCG GGAGCCCCAC TCCTCCGTGG TGTGTTCCAT 120 

TTGCTTCCCA CATCTGGAGG AGCTGACGTG CCAGCCTCCC CCAGCACCAC CCAGGGACGG 180 

GAGGCATGAG CCGGTCAAGG CACCTGGGCA AAATCCGGAA GCGTCTGGAA GATGTCAAGA 240 

GCCAGTGGGT CCGGCCAGCC AGGGCTGACT TTAGTGACAA CGAGAGTGCC CGGCTGGCCA 300 

CGGACGCCCT CTTGGATGGG GGTTCTGAAG CCTACTGGCG GGTGCTCAGC CAGGAAGGCG 360 

AGGTGGACTT CTTGTCCTCG GTGGAGGCCC AGTACATCCA GGCCCAGGCC AGGGAGCCCC 420 

CGTGTCCCCC AGACACCCTG GGAGGGGCGG AAGCAGGCCC TAAGGGACTG GACTCCAGCT 480 

CCCTACAGTC CGGCACCTAC TTCCCTGTGG CCTCAGAGGG CAGCGAGCCG GCCCTACTGC 540 

ACAGCTGGGC CTCAGCTGAG AAGCCCTACC TGAAGGAAAA ATCCAGCGCC ACTGTGTACT 600 

TCCAGACCGT CAAGCACAAC AACATCAGAG ACCTCGTCCG CCGCTGCATC ACCCGGACTA 660 

GCCAGAACAT TTCCATCCGG AGTGTGGAAG GAGAGATATA CTGTGCCAAG TCAGGCAGGA 720 

AATTCACTGG CCAAATCCGG GAGAAGTTCA TCATCTCGGA CTGGAGATTT GTCCTGTCTG 780 

GATCTTACAG CTTCACCTGG CTCTGCGGAC ACGTGCACCG GAACATCCTC TCCAAGTTCA 840 

CAGGCCAGGC GGTGGAGCTG TTTGACGAGG AGTTCCGCCA CCTCTACGCC TCCTCCAAGC 900 

CTGTGATGGG CCTGAAGTCC CCGCGGCTGG TCGCCCCCGT CCCGCCCGGA GCAGCCCCGG 960 

CCAATGGCCG CCTTAGCAGC AGCAGTGGCT CCGCCAGTGA CCGCACGTCC TCCAACCCCT 1020 

TCAGCGGCCG CTCGGCAGGC AGCCACCCCG GTACCCGAAC TGACGGCTGA GATGAGGTTA 1080 

GAATGACTGG GCCCGGCTGA ACATTCCAAA TTGGATTTCA CCATCTGCTG AGAAAGTTTA 1140 

AGGAAGGCAA AGCTTGCCAG GTCACAGAAG CTCCCAAGCC CAGCTTTCCA AAGGCCTCAG 1200 

CCTGTGCCTG TGTCGAGCTC AGTCCTGGGA GATAGGGGAG AACCTGCAGG CAGGAACAAG 1260 

CCCCCCTACT CCTGACCACC CTCCATCAGC AGTCTCCCCT CCGTGGTCGT CTTTGTTGAC 1320 

AAAGGTGCAG TTTCTCCTCT CCTGGGCACC TGTAACATGT GATGCGCTGC CTGCTGGGAG 1380 

GTTAGGTCGG GGCTGCCCCG GCGAGTGGAG CATGAGCAGA ACGGCCGAGG ^CACTTCTG 1440 

GGCAGAAGCT TTGAGAGCCT GGGTCCAGGT TGCCACATAG AAGCAGCTCT CGAGTTGAAA 1500 

CCCTCCTCTG CCAGCCTGGG GTCCTAAGCG ATGAGCAGAA TCCCCCACTC CCACCCCACC 1560 

AACCCACAAT GGATATGTAG TGAGCAAGAA ATAAACCTTT GTTGTTTAAA AAAAAAAAAA 1620 
AAAAAAAA 

Seq ID NO: 41 Protein Sequence 
Protein Accession fit NP_1162B8.1 

1 11 21 31 41 51 

Lrsrhlgki rkrledvksq wvrparadfs dnesarlatd alldggseay wrvlsqegev 

DFLSSVEAQY IQAQAREPPC PPDTLGGAEA GPKGLDSSSI* QSGTYFPVAS EGSEPALLHS 
WASAEKPYLK EKSSATVYFQ TVKHNNIRDL VRRCITRTSQ NISIRSVEGE ^CAKSGRKF 
TGQIREKFII SDWRFVLSGS YSFTWLCGHV HRNILSKFTG QAVELFDEEF RHLYASSKPV 
MGLKSPRLVA PVPPGAAPAN GRLSSSSGSA SDRTSSNPFS GRSAGSHPGT RTDG 

Seq ID NO: 42 DNA sequence 
Nucleic Acid Accession 4h NMJJ00782.2 
Coding sequence: 405.. 1946 

1 11 f f i 1 i 1 

icGAGAGGGA CAGGAGGAAA CGCAGCGCCA GCAGCATCTC ATCTACCCTC CTTQACACCT 60 

SSgct ccagcaagcc ctagaggtca gccttgcgga ccaacaggag gactcccagc 12 
£5£ctttt caagaggtac cccagacacc ggccaccctc ttccagcccc tgcggccagt 

GCAAGGAGGC ACCAATGCTC TGAGGCTGTC GCGTGGTGCA GCGTCGAGCA TCCTCGCCOA 
GTCCTTCTGC TGCCTGTCCC GCCTCACCCC GCTCCATCAC ACCAGCTCCC CC^TTTGCX 



60 
120 
180 
240 



?cS^C AGAATCGTTA ISSSSS CCCACTAGCA CCTCGTACCA ACCTCGCCCC 
IcSS CGCGCTCCGG TGTCCCCCGC TGCCATGAGC ^CCCATCA 

GCAAGAGCCG CTCGCTTGCC GCCTTCCTGC AGCAGCTGCG CAGTCCGAGG CAGCCCCCGA 
GACTGGTGAC ATCTACGGCG TACACGTCCC CTCAGCCGCG AGAGGTGCCA GTCTGCCCGC 
SS TGGCGAGACT CAGAAOGCGG CCGCCCTGCC GGGCCCCACC AGCTGGCCAC 
TGCTGGCGAG CCTGCTGCAG ATTCTCTGGA AAGGGGGTCT CAAGAAACAG CACGACACCC 

242 
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PCT/US02/21338 



TGGTGGAGTA CCACAAGAAG TATGGCAAGA TTTTCCGCAT GAAGTTGGGT TCCTTTGAGT 720 

CGGTGCACCT GQQCTCGCCA TGCCTGCTGG AAGCGCTGTA CCGCACOGAG AGCGTACCCC 780 

AGCGGCTGGA GATCAAACGG TGGAAGGCCT ATCGCGACTA CCGCAAAGAA GGCTACGGGC 840 

TGCTGATCCT GGAAGGGGAA GACTGGCAGC GGGTCCGGAG TGCCTTTCAA AAGAAACTAA 900 

5 TGAAACCAGG GGAAGTGATG AAGCTGGACA ACAAAATCAA TGAGGT CTTG GCCGATTTTA 960 

TGGQCAGAAT AGATGAGCTC TGTGATGAAA GAGGCCACGT CGAAGACTTG TACAGCGAAC 1020 

TGAACAAATG GTCGTTTGAA AGTATCTGCC TCGTGTTGTA TGAGAAGAGA TTTGGGCTTC 1080 

TCCAGAAGAA TGCAGGGGAT GAAGCTGTGA ACTTCATCAT GGCCATCAAA ACAATGATGA 1140 

GCACGTTTGG GAGGATGATG GTCACTCCAG TCGAGCTGCA CAAGAGCCTC AACACCAAGG 1200 

10 TCTGGCAGGG ACACACTCTG GCCTGGGACA CCATTTTCAA ATCAGTCAAA GCTTGTATCG 1260 

ACAACCGGTT AGAGAAGTAT TCTCAGCAGC CTAGTGCAGA TTTCCTTTGT GACATTTATC 1320 

ACCAGAATCG GCTTTCAAAG AAAGAATTGT ATGCTGCTGT CACAGAGCTC CAGCTGGCTG 1380 

CGGTGGAAAC GACAGCAAAC AGTCTAATGT GGATTCTCTA CAATTTATCC CGTAATCCCC 1440 

AAGTGCAACA AAAGCTTCTT AAGGAAATTC AAAGTGTATT ACCTGAGAAT CAGAGGCCAC 1500 

15 GGGAGGAAGA TTTGAGGAAT ATGCCGTATT TAAAAGCCTG TCTGAAAGAA TCTATGAGGC 1560 

TTACCCCGGG TGTACCATTT ACAACTCGGA CTCTTGACAA GGCAACAGTT CTGGGTGAAT 1620 

ATGCTTTACC CAAAGGAACA GTGCTCATGC TAAATACCCA GGTGTTGGGA TCCAGTGAAG 1680 

ACAATTTTGA AGATTCAAGT CAGTTTAGAC CTGAACGTTG GCTTCAGGAG AAGGAAAAAA 1740 

TTAATCCTTT TGCGCATCTT CCATTTGGCG TTGGAAAAAG AATGTGCATT GGTCGCCGAT 1800 

20 TAGCAGAGCT TCAACTGCAT TTGGCTCTTT GTTGGATTGT CCGCAAATAC GACATCCAGG 1860 

CCACAGACAA TGAGCCTGTT GAGATGCTAC ACTCAGGCAC CCTGGTGCCC AGCCGGGAAC 1920 

TCCCCATCGC GTTTTGCCAG CGATAATACG CCTCAGATGG TGGTATTTGC TAACATCATA 1980 

TCCAACTCAG GGAAGCGGAC TGAGTGCTGG GATCCAAGGC ATTCTACAGG GTTCACTGCT 2040 

GGTTTACACT TCACCTGTGT CAGCACCATC TTCAGGTGCT TAGAATGGCC TGGGAGCCTG 2100 

25 TTCTGTCTTG CATCTTCCAT GACATGAAAG GGAGGCTGGC ACTTGTCAGT CAGGTAGAGG 2160 

TTACAAACCG TTTCAGGCCC TGCTACCACA TTCACTGTTT GAATCTTTAA TTCCCAAGAA 2220 

TAAGTTTACA TTTCACAATG AATGACCTAC AACAGCTAAA TTTTCTGGGG CTGGGAGTAA 2280 

TACTGACAAT CCATTTACTG TAGCTCTGCT TAATGTACTA CTTAGGAAAA TGTCCCTGCT 2340 

TAATAATGTA AGCCAAGCTA AATGATGGTT AAAGTTATCA GGCCTCCCAT GAAATTGCGT 2400 

30 TCTTCCTGCA TTGAAATAAA AACATTATTG GGAAACTAGA GAACACCTCT ATTTTTAAAA 2460 

GGACTTTAAC GAAGTCAAAC AACTTCTAAG ACTAGTGATT CACTGGGGCA TTATTTGTTA 2520 

GAGGACCTTA AAATTGTTTA TTTTTTAAAT GTGATTCCTT TATGGCATTA GGGTAAAGAT 2580 

GAAGCAATAA TTTTTAAATT GTGTATGTGC ATATGAAGCA CAGACATGCA TGTGTGTGTG 2640 

TGTCTGTGTG TGTGTGTCCG TGTATGTGTG TGTGGGTTCT AATGGTAATT TGCCTCAGTC 2700 

35 ATTTTTTTAA TATTTGCAGT ACTTGATTTA GGATCTGTGG TGCAGGGGAT GTTTCAAAGT 2760 

TTAGTCACAG CTTAAAAACA TTCAGTGTGA CTTTAATATT ATAAAATGAT TTCCCATGCC 2820 

ATAATTTTTC TGTCTATTAA ATGGGACAAG TGTAAAGCAT GCAAAAGTTA GAGATCTGTT 2880 

ATATAACATT TGTTTTGTGA TTTGAACTCC TAGGAAAAAT ATGATTTCAT AAATGTAAAA 2940 

TGCACAGAAA TGCATGCAAT ACTTATAAGA CTTAAAAATT GTGTTTACAG ATGGTTTATT 3000 

40 TGTGCATATT TTTACTACTG CTTTTCCTAA ATGCATACTG TATATAATTC TGTGTATTTG 3060 

ATAAATATTT CTTCCTACAT TATATTTTTA GAATATTTCA GAAATATACA TTTATGTCTT 3120 

TATATTGTAA TAAATATGTA CATATCTAGG TATATGCTTT CTCTCTGCTG TGAAATTATT 3180 

TTTAGAATTA TAATTCACGT CTTGTCATAT TTCATCTGTA TACCTTCAAA TTCTCTGAAA 3240 
GTAAAAATAA AAGTTTTTAA ATATT 



45 



60 



65 



Seq ID NO : 43 Protein Sequence 
Protein Accession ft: NP_000773.1 



1 11 - 21 31 |1 51 

msspisksrs Laflqqlrs PRQPPHLVTB TAYTSPQPRE vpvcpltagg et^naaalpg 

PTSWPLLASL LQILWKGGLK KQHDTLVEYH KKYGKIFRMK LGSFESVHLG SPCLLEALYR 
TESVPQRLEI KPWKAYRDYR KEGYGLLILE GEDWQRVRSA FQKKLMKPGE VMKl^NKINE 
VLADFMGRID ELCDERGHVE DLYSELNKWS FESICLVLYE KRFGLLQKNA GDEAVNFIMA 
55 IKTMMSTFGR MMVTPVELHK SLNTKVWQGH TLAWDTIFKS VKACIDNRLE KYSQQPSADF 

LCDIYHQNRL SKKELYAAVT ELQLAAVETT ANSLMWILYN LSRNPQVQQK U^IQ^ 
ENQRPREEDI* RNMPYLKACL KESMRLTPGV PFTTRTLDKA TVLGEYALPK GTVimNTQV 420 
LGSSEDNFED SSQFRPERWL QEKEKINPFA HLPFGVGKRM CIGRRLAELQ LHLALCWIVR 480 
KYDIQATDNE PVEMLHSGTL VPSRELPIAF CQR 



Seq ID NO: 44 DIJA sequence 

Nucleic Acid Accession ft: NM_007000.2 

Coding sequence: 1-777 
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1 11 21 31 41 51 

ItGGCGTCTG CGGCAGCAGC GGAGGCCGAG AAGGGATCTC CAGTTGTGGT GOQCCTGCTA 
GTTGTGGGCA ATATCATTAT TCTGCTGTCA GGCCTGTCCC TGTTTGCTGA ^CATATGG 
GTGACAGCCG ACCAGTACCG TGTATACCCA CTGATGGGAG TCTCAGGCAA GGATGACGTC 
70 TTCGCTGGTG CCTGGATTGC CATCTTCTGC GGCTTCTCCT TCTTCATGGT AGCCAGTTTT 

GGTGTGGGTG CCGCACTCTG CCGCCGCCGG TCCATGGTCC TCACGTACCT GGTGCTCATG 300 
CTCATCGTCT ACATCTTCGA GTGCGCCTCC TGCATCACGT CCTACACCCA CCGTGACTAC 360 
ATGGTGTCCA ACCCATCCCT GATCACCAAG CAGATGCTGA CCTTCTACAG CGCGGACACC 42 0 
GACCAGGGCC AGGAGCTGAC CCGCCTCTGG GACCGCGTCA TGATTGAGCA AGAATGCTGT 
75 GGCACATCTG GTCCCATGGA CTGGGTGAAC TTCACGTCAG CCTTCCGGGC GGCCACTCOG 
GAGGTGGTGT TCCCCTGGCC CCCACTGTGC TGTCGCCGGA CGGGAAACTT CATCCCCCTC 
AACGAGGAGG GCTGCCGCCT GGGGCACATG GACTACCTGT TCACCAAGGG CTGCTTCGAA 
CACATCGGCC ACGCCATCGA CAGCTACACG TGGGGTATCT CGTGGTTTGG GTTTGCCATC 

243 
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CTGATGTGGA CGCTCCCGGT CATGCTGATA GCCATGTATT TCTACACCAT GCTCTGAGGG 780 . 
ACAGGAGGGG AAGGCAACAT ACACACCCCG GACTCCTCOG CATCCTCCTC CTGCTTCCTC 840 
CGCTGGGCCT GGATGGCTGC CTCACCTCTC ACCTCCCAAC GTCCCTAGCC CTTACGTCCT 900 
TCCACTTCCA AGATCTTTTT CCAGGTTCCT GAGCCCTACT GTGTCTCAGG TGTGCCCTGA 960 

AACCCCAGGG CTTGTGTGCA CATATCCTTA GCCCATCTTT CAAGGGACCT CTCCATGATC 1020 

CCACCTCCCA TTCACAGATA CCTCTCTTGT AGCTCTCTGA CCTCCTCCTT CATGGCAGGC 1080 

ATOGCCATTC TTGCTGAACC GTTTGTGATT GCCATTTGAG CTCTGGAAGC CTCTATTGCC 1140 * 

ATGAGAGTTC TGTCACGGTC ACTTTACTGT CCCCATCATC ACCCAGCACG GGGCTAAGCA 1200 
TATACTAGAT AGTCAATAAA TAA 



seq ID NO: 45 Protein sequence: 
Protein Acceasion ft: NPJJ08931.1 



1 11 21 31 41 51 

I I I I I I 

KASAAAAEAE KGSPVWGLL WGNIIILLS GLSLFAETIW VTADQYRVYP LMGVSGKDDV 60 
FAGAWIAIFC GPSFFMVASP GVGAALCRRR SMVLTYLVLM LIVYIFECAS CITSYTHRDY 120 
MVSNPSLITK QMLTFYSADT DQGQEVTRLW DRVMIEQECC GTSGPMDWVN FTSAFRAATP 180 
EWFPWPPLC CRRTGNFIPL NEEGCRLGHM DYLFTKGCFE HIGHAIDSYT WGISWFGFAI 240 
LMWTLPVMLI AMYFYTMIi 



Seq ID NO: 46 DNA sequence 

Nucleic Acid Accession ft: NM_006760.1 

Coding sequence : 3 9 . . 593 

1 11 21 31 41 51 

GAAAGCCTGC CAGCACCTAT TCCACCTCCC AGCCCAGCAT GGCACCCCTG CTGCCCATCC 60 

GGACCTTGCC CTTGATCCTG ATTCTGCTGG CTCTGCTGTC CCCAGGGGCT GCAGACTTCA 120 

ACATCTCAAG CCTCTCTGGT CTGCTGTCCC CGGCGCTAAC GGAGAGCCTG CTGGTTGCCT 180 

TGCCCCCCTG TCACCTCACA GGAGGCAATG CCACACTGAT GGTCCGGAGA GCCAATGACA 240 

GCAAAGTGGT GACGTCCAGC TTTGTGGTGC CTCCGTGCCG TGGGCGCAGG GAACTGGTGA 300 

GTGTGGTGGA CAGTGGTGCT GGCTTCACAG TCACTCGGCT CAGTGCATAC CAGGTGACAA 360 

ACCTCGTGCC AGGAACCAAA TTCTACATTT CCTACCTAGT GAAGAAGGGG ACAGCCACTG 420 

AGTCCAGCAG AGAGATCCCA ATGTCCACAC TCCCTCGAAG GAACATGGAA TCCATTGGGC 480 

TGGGTATGGC CCGCACAGGG GGCATGGTGG TCATCACGGT GCTGCTCTCT GTCGCCATGT 540 

TCCTGCTGGT GCTGGGCTTC ATCATTGCCC TGGCACTGGG CTCCCGCAAG TAAGGAGGTC 600 

TGCCCGGAGC AGCAGCTTCT CCAGGAAGCC CAGGGCACCA TCCAGCTCCC CAGCCCACCT 660 

GCTCCCAGGC CCCAGGCCTG TGGCTCCCTT GGTGCCCTCG CCTCCTCCTC CTGCCCTCCT 720 

CTCCCCTAGA GCCCTCTCCT CCCTCTGTCC CTCTCCTTGC CCCCAGTGCC TCACCTTCCA 780 

ACACTCCATT ATTCCTCTCA CCCCACTCCT GTCAGAGTTG ACTTTCCTCC CATTTTACCA 840 

CTTTAAACAC CCCCATAACA ATTCCCCCAT CCTTCAGTGA ACTAAGTCCC TATAATAAAG 900 
GCTGAGGCTG CATCTGCCAA AAAAAAAAAA AA 



Seq ID NO « 47 Protein Sequence 
Protein Accession II: NPJJ06751.1 

1 11 21 

I I I 

MAPLLPIRTL PLILILLALI* SPGAADFNIS 
MVRRANDSKV VTSSFWPPC RGRRELVSW 
VKKGTATESS REIPMSTLPR RNMES IGLGM 
GSRK 



31 41 51 

1 I 1 

SLSGLLSPAL TESLLVAIjPP CHLTGGNATL 60 

DSGAGFTVTR LSAYQVTNLV PGTKFYISYL 120 

ARTGGMWIT VLLSVAMFLL VLGFIIAIAL 180 



Seq ID NO: 48 DNA sequence 

Nucleic Acid Accession ft: EOS sequence 

Coding sequence: 200-2932 

1 11 21 31 41 51 

ATTGCTGATG GATCAGTGAG CCTGTGTTCA TGCCAGTGAG CTGCTGTGGC TCAGATACTG 60 
ATACTTTCTT TCCAAACAGC ATAAGAAGTG ATTGAGCCAC AAGTATACTG AAGGAAGGGC 120 
TCCCTCGAGT TCTGGTGTGA AGAGATAAAT CACCAGTCAC AGACTATGCA CCCGACTGCT 180 
GCTGTTCAGT CCAGGGAAAA TGAAAGTTGG AGTGCTGTGG CTCATTTCTT TCTTCACCTT 
CACTGACGGC CACGGTGGCT TCCTGGGGAA AAATGATGGC ATCAAAACAA AAAAAGAACT 
CATTGTGAAT AAGAAAAAAC ATCTAGGCCC AGTCGAAGAA TATCAGCTGC TGCTTCAGGT 
GACCTATAGA GATTCCAAGG AGAAAAGAGA TTTGAGAAAT TTTCTGAAGC TCTTGAAGCC 
TCCATTATTA TGGTCACATG GGCTAATTAG AATTATCAGA GCAAAGGCTA CCACAGACTG 
CAACAGCCTG AATGGAGTCC TGCAGTGTAC CTGTGAAGAC AGCTACACCT GGTTTCCTCC 
CTCATGCCTT GATCCCCAGA ACTGCTACCT TCACACGGCT GGAGCACTCC CAAGCTGTGA 600 
ATGTCATCTC AACAACCTCA GCCAGAGTGT CAATTTCTGT GAGAGAACAA AGATTTGGGG 660 
CACTTTCAAA ATTAATGAAA GGTTTACAAA TGACCTTTTG AATTCATCTT CTGCTATATA 720 
CTCCAAATAT GCAAATGGAA TTGAAATTCA ACTTAAAAAA GCATATGAAA GAATTCAAGG 780 
TTTTGAGTCG GTTCAGGTCA CCCAATTTCG AAATGGAAGC ATCGTTGCTG GGTATGAAGT 840 
TGTTGGCTCC AGCAGTGCAT CTGAACTGCT GTCAGCCATT GAACATGTTG CCGAGAAGGC 900 
TAAGACAGCC CTTCACAAGC TGTTTCCATT AGAAGACGGC TCTTTCAGAG TGTTCGGAAA 960 
AGCCCAGTCT AATGACATTG TCTTTGGATT TGGGTCCAAG GATGATGAAT ATACCCTGCC 1020 
CTGCAGCAGT GGCTACAGGG GAAACATCAC AGCCAAGTGT GAGTCCTCTG GGTGGCAGGT 

244 



180 
240 
300 
360 
420 
480 
540 



1020 
1060 



WO 03/003906 



CATCAGGGAG 
TGTAGGCAAT 
TCGGCAAAAC 
TATTTCATCT 
CATCAGTATA 
GCGGGAAGAA 
GGTGCCTCCG 
TCCAGTGAAC 
AAATACATCT 
CCTTCCAGAA 
CAAAAATGGA 
CATAAATGAA 
TGTGTTTTGG 
AACTCAAGAC 
ACCTTTTGTC 
TATCTCCATT 
TAAAAAAAGC 
CTTGATTGCT 
TGGAGTCTGC 
GATGCTCATG 
CCAGCATTTG 
TGTCATTACC 
GCTTAACTGG 
TGTGGCTGTG 
TGGGGAAAGA 
CATTCTGACC 
CCAGAATCTG 
CTTATGCTTT 
TGCCTTAAGT 
CAAATTCTCA 
TGGAGATTCC 
ATCATAAAAT 
AGAAATTATG 
AATGTCTCAG 



ACTTGTGTGC 
GCCACTGAGG 
CCATCAACCA 
CTGTCACTGG 
GCTGACAATA 
AAGTATGCCA 
ACAGCTCTTC 
AAAAGCCAAC 
ATTCCCATCA 
ACTATTATCA 
AATGCTCAGG 
GTTTTCCTAT 
GATTTCAGTC 
ATCGTGACGT 
CCCTCTACAA 
GGAAGTCTCA 
CAAACCTCTC 
GATGTCTGGT 
ACAGCTGCTG 
CTTGGCATCC 
ATGATGGCTG 
ATTGCTGTCA 
TCCAATGGAA 
AACTTCGTTG 
CTGAGTCGGG 
CCTCTGCTAG 
GCTTGGCATG 
GGAATACTCT 
TCTTGGAAGC 
AAGCCTTTCA 
TCCGACAACA 
CAAGAAAAAA 
CTCAGTATTC 
TGGCTTCA 



TCTCTCTGCT 
CAGCTGTGTC 
CAGTGGGGAA 
CCAGCCATTT 
TCCTTAATTC 
GCTCACGGTT 
CTCTGAATTT 
TCAAAAGGGG 
GAGGCOGTGT 
GCATGGCCTC 
TCAATGGACC 
TTTTTTCCAA 
ATTTGCAGTG 
GCCAATGTAC 
TCTTCCCCGT 
TTTTATGCCT 
ACACACGTCG 
TTATTGTTGG 
TGTTCTTTAC 
TGCTGGCTTA 
TTGGATTTTG 
CGCAACCTAG 
GCAAACCACT 
TGGTGCTGCT 
ATGACAAGGC 
GGCTCACCTG 
TTATTTTTGC 
TGGACAGTAA 
AAACAGAAAA 
ACCCACTGCA 
TCATGCTAAC 
TTTCCAGAAC 
GATCGGGTTT 



TGAAGAACTG 
ATCCTTCGTG 
TCTGGCTTCG 
CAGGGTGTCC 
AGCCTCAGTA 
ACTAGAGACA 
TTCTCGGAAA 
TTACAGCTAT 
GTTAATTGGG 
GTTGACTCTG 
TGTGATATCC 
GATAGAGTCA 
GAACGATGCA 
TCACTTGACC 
TGTAAAATGG 
GATCATCGAG 
TATTTGCATG 
TGCCACAGTG 
ACACTTCTTC 
CCGGATCATC 
CCTGGGTTAT 
CAATACCTAC 
CCTGGCTTTT 
AGTTCTCACA 
CACCATCATC 
GGGCTTTGGA 
TTTACTCAAT 
GCTGCGACAA 
GCAAAACTCA 
AAACAAAGGC 
TCAGTTTGTC 
AACTTGACAT 
TCTGATTTAG 




Seq ID NO: 4 9 Protein sequence: 
Protein Accession ft: EOS sequence 



MKVGVLWLIS 
EKRDLRNFLK 
NCYLHTAGAL 
IEIQLKKAYE 
LFPLEDGSFR 
LSLLEELNKN 
ASHFRVSNST 
PLNFSRKFID 
SMASLTLGNI 
HLQWNDAGCH 
ILCLIIEALF 
VFFTHFFYLS 
TQPSNTYKRK 
DDKATIIRVG 
LDSKLRQLLF 
IMbTQFVSNE 



11 
I 

FFTFTDGHGG 
LLKPPLLWSH 
PSCECHLNNL 
RIQGFESVQV 
VFGKAQCNDI 
FSMIVGNATE 
MEDVISIADN 
WKGIPVNKSQ 
LPVSKNGNAQ 
LVNETQDIVT 
WKQIKKSOTS 
LFFWMLMLGI 
DVCWLNWSNG 
KSLLILTPLL 
NKLSALSSWK 



21 
i 

FLGKNDGIKT 
GLIRIIRAKA 
SQSVNFCERT 
TQFRNGSIVA 
VFGFGSKDDE 
AAVSSFVQNL 
ILNSASVTNW 
LKRGYSYQIK 
VNGPVISTVI 
CQCTHLTSFS 
HTRRICMVNI 
LLAYRIILVF 
SKPLLAFWP 
GLTWGFGIGT 
QTEKQNSSDL 



31 
I 

KKELIVNKKK 
TTDCNSLNGV 
KIWGTFKINE 
GYEWGSSSA 
YTLPCSSGYR 
SVI1RQNPST 
TVLLREEKYA 
MCPQNTSIPI 
QNYSINEVFL 
ILMSPFVPST 
ALSLLIADVW 
HHMAQHLMMA 
ALAIVAVNFV 
IVDSQNLAWH 
SAKPKFSKPF 



41 
I 

HLGPVEEYQL 
LQCTCEDSYT 
RFTNDLLNSS 
SELLSAIEHV 
GNITAKCESS 
TVGNIiASWS 
SSRLLETLEN 
RGRVLIGSDQ 
FFSKIESNLS 
IFPWKWITY 
FIVGATVDTT 
VGFCLGYGCP 
WLLVLTKLW 
VIFALLNAFQ 
NPLQNKGHYA 



SI 
I 

LLQVTYRDSK 
WFPPSCLDPQ 
SAIYSKYANG 
AEKAKTALHK 
GWQVIRETCV 
ILSNISSLSL 
ISTLVPPTAL 
FQRSLPETII 
QPHCVFWDPS 
VGLGISIGSL 
VNPSGVCTAA 
LIISVITIAV 
RPTVGERLSR 
GFFILCFGIL 



60 
120 

ieo 

240 
300 
360 
420 
460 
540 
600 
660 
720 
780 
840 
900 



Seq ID NO: 50 DNA sequence 

Nucleic Acid Accession #: EOS sequence 

Coding sequence: 63-3224 



1 
I 

CAGGAATTTG 
CCATGTCGAG 
AGCAGCAGCC 
CGGGGGAGCT 
AAGCCACAGT 
AGTTCTTCAG 
TCCTCATCAT 
TGAGCCACCA 
CAGAGGACAT 
CCCTGCCACC 
ATGTGACCTT 
CACTCCCTGC 
GTCTGCAGCA 
ACATGTGGGC 
TGGGCAGCCA 



11 
I 

TGGCGGAGAG 
GCGCAAGCAG 
GCAQCAQCAG 
GGGTGCTCCA 
AAAGCGGCTT 
CATCTCTGAG 
GAATGACAGC 
GCCCACCAGT 
GAAGGAQAAG 
CACCCCCCAG 
GCAGGCACTA 
CCCCGTGCCT 
GCAGCAGCTA 
CTCCCACGCC 
CATGTCTCAG 



21 
I 

GGCAAATAAC 
GCGAAACCCC 
ACCCCGGAGT 
GTGAACCACC 
CGTCGGGAGG 
TTCCTGGAAC 
GAGGGGCCTG 
CCCGGCAGTA 
CCGGATGCGG 
GACATAAGCT 
CGGGGCACCA 
GGTGCCAACA 
CAGCAGATCC 
CTCCACTCAA 
CAGGTTTCTG 



31 
1 

TGCGGCTCTC 
AGCACATCAA 
TTGCAGATGC 
CAGGGAATGA 
AGACGCACGT 
ATAAGAAAAA 
TGCCTTCAGA 
AGGACTGTCA 
AGTCTGTGGT 
ATTTAGCCAA 
AGGTGGCGGT 
GCATCCCGTG 
AGCTCACCGA 
GCGGGGCAGG 
CAGCTGTGGC 



41 
I 

CCGGCGCCCC 
CTCGGAGGAG 
GGCCCCAGCG 
CGAGGTGGCG 
CTGTGAGAAA 
TTGCACTAAA 
AGACTTCTCC 
CAGGGAGAAT 
GTACCTAAAG 
AGGCAAAGTG 
GAATCAGCGG 
GGTCCTCGAG 
GCAGATCCGC 
GGCCGACACT 
TTTGCTCAGC 



51 
I 

GATGCTCGCA 
GACCAGGGCG 
GCGCCCGCGG 
AGTGAGGATG 
TGCTGTGCGG 
AATCCACCTG 
GGAGCTGTAC 
GGCGGCAGCT 
ACAGAGACAG 
GCCAACACTA 
AGCGCGGATG 
CAGATCTTGT 
ATCCAGGTGA 
CTGAAGACCT 
CAGAAAGCTG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
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GAAGCCAAGG 
CTTCTGCCAC 
CCCGGGTGCT 
GCTCGGTGCT 
AGGGGAAGCC 
ACAAGCACAA 
ACCTCCGCTC 
CCACCAAGGG 
AGCTGTTTGC 
CTGTACCTGA 
CCACCTCTGT 
GTGGCTCCTT 
CACTCCCTGG 
GGACCCCTGA 
AGGCCACCAC 
CCCTCAAGAT 
GCCGAGCCTT 
CATCCATTAA 
TGCTGCAGCA 
AGAATCCCTG 
GCGCTATCTG 
AGGCTCCCAG 
CCACGCTAGG 
CTTTTAACCT 
CCAACGACTC 
TGGAAACCAC 
CAAAGTCTCC 
AAGGTCGGAG 
AAGTTCCTGG 
CAGCCCAGCC 
CGTCTGCTAG 
GCAACATTTG 
ACGGGGCGAA 
CTCTGTTAGG 
CTTCAGTGAA 
TGGCCGTGAA 
TTTCCTTGGG 
AGTCGGGTAT 
AGTTTCCTCA 
AAGGAGCAAT 
TCCTCCTGTT 
ACAACCTCAG 



TCTGTCTCTG 
CAGCTCCCTG 
CCCGAACGTC 
CTTCCAGAGC 
ACCGAACATC 
GTGTAAGTAC 
CCACACTGGA 
CAACCTCAAG 
COAGTTCCAG 
CCCCATAGAT 
AGGGCTACCT 
GCCOGGTGAC 
GGTGGGACCA 
GCCAGGGTCA 
TGATCCCAAC 
GCATTATCGC 
TTCTACCAAA 
GACGCAGCAT 
ACATATTCGG 
TGACTTTACG 
CCATGATGAT 
CAGCTCCTCC 
GTTTGCCATG 
GCAGCGCCAG 
ATCCTCGCTG 
ATCCTTCCAG 
CGATGCTGGG 
CAGTCTCCCT 
CACATTTGTG 
ACGCCGACAG 
CGCTCTTCAG 
TGGGCGAGCT 
CAATAACTCA 
TACGGACGGA 
TGTGGACCCT 
GACCAATGAG 
GGCCACCTCC 
CAGTGCAGAT 
CTTCCTGGAA 
GCAGACACAG 
TTCTTTTTCT 
GCAAGTGCTA 



GATGCCTTGA 
TCCCCAGGGC 
ATGTCCCGCC 
CCTTTCTCCA 
TCCGCGGTGG 
TGTAGCAAGG 
GAGAGACCCT 
GTGCACTTTC 
GACAAAGTGG 
GAACCGAGTC 
CAGAATCTTT 
CTGCAGCCTG 
AACTATAATT 
GAGACCCTGA 
GAATGTCTCA 
ACCCACACCG 
GGTAACCTGA 
TCGTGCCCCA 
ATGCACATGG 
GGTTCTGAGC 
GTCATCGAAA 
AAGGTCCCCA 
ATGGCTTCCT 
GGCAGCAGAG 
ATGGGAGACC 
GCACTCTCCC 
AGCAAAGCAG 
TCCAQGTTTA 
GGACCCTCGA 
GCCAAGCAAC 
ATCCACGAGC 
TTTACCACCA 
GCCCGCCGTG 
AAAAGAGTCT 
GTTGTGTGGA 
ATCTCTGTGA 
GTTGTGAATA 
GTGGAAAAAC 
GAAAACAAGA 
TGAAATCTCT 
TACTGATATG 
CAATCACGAT 



AACAAGCCAA 
TGGCACCCTT 
TCCCGAGCGC 
CTGTGGCGCT 
ATGTCAAACC 
TTTTTGGOAC 
TCGTGTGCTC 
ACCGACATCC 
CGGCCGGCAA 
TTTCTTTAGA 
CTTCGGGGAC 
GGCCTTCTCC 
CCCCAAGGGC 
AATTGCAGCA 
TTTGCCACCG 
GGGAGAGACC 
AGACACACCT 
TCTGCCAGAA 
GCGGTCAGAT 
CAATGACCGT 
GCATCGATGT 
CGCCTCTTCC 
TAGATGCCCC 
AAAACGGTTC 
AGGAGTATCA 
CGGCCAATAG 
AGAGCTCCGA 
TCCGAGCCCC 
CATTGTCCCC 
ATGGCTGCAC 
GGACTCACAC 
AAGGCAACTT 
GAAGGAAGTT 
CAGAAATCTT 
ACCAGTACAC 
TCCAGAGTGG 
ACGCCACTGT 
CAAGTGCTAC 
TTGCGGTCAG 
AGAATCTGCT 
CAAATGATGT 
TGTTGCTATG 



GCTACCTCAC 
CACTCTGAAG 
TTTGCTTCCT 
AGACACATCC 
CAAAGACGAG 
TGATAGCTCC 
TGTCTGTGGT 
CCAGGTGAAG 
TGGCATCCCC 
CAGCAAACCT 
TAATCCCAAG 
AGAAAGTGAG 
TGGTGGCTTC 
GTTGGTGGAG 
AGTCTTAAGC 
GTTCCAGTGT 
TGGGGTTCAC 
GAAGTTCACT 
TCCCAACACG 
GGGTGAGAAC 
AGAGGAAGTC 
CAGCATCCAC 
AGGGAAAGTG 
CGTGGAGAGC 
GAGCCGAAGC 
TCAAGCCGAA 
GAACAGCCGC 
GCCGACCTAT 
AGGGATGACC 
ACGGTGTGGG 
TGGAGAGAAG 
AAAGGTTCAC 
GGCCATCGAG 
TCCCAAGGAA 
CAGCATGCTC 
GGGGGTTCCT 
CTCCAAGATG 
TGACGGCGTT 
CTAAGGGAGA 
TTGTTTTGTA 
TTACTACGTT 
CTGCTTTGCA 



GCCAACATCC 
CCGGATGGGA 
CAGGCCCCGG 
AAGAAAGGGA 
GCGGCCCTCT 
TTGCAGATCC 
CATCGCTTCA 
GCAAACCCCC 
TATGCACTCT 
GTCCTTGTAA 
GACCTCACGG 
GGTGGACCCA 
CAAGGGAGTG 
AACATTGACA 
TGTCAGAGCT 
AAGATCTGTG 
CGAACCAACA 
AATGCCGTGA 
CCCCTGCCAG 
GGCAGCACCG 
AGCTCCCAGG 
TCGGCATCAC 
GGTCCTGCCC 
GATGGCTTGA 
CCAGATATCC 
AGCATCAAGT 
ACTGAGATGG 
GTCAAGGTTG 
CCTTTGTTAG 
AAGAACTTCT 
CCTTTTGTGT 
TACATGACAC 
AACACCATGG 
ATCCTGGCCC 
AATGGCGGTC 
ACCCTCCCGG 
GATGGCTCCC 
CCCAAACACC 
ACTTGCGTGG 
AGAACTCATC 
GGTTGTGACC 
AAAAGTTG 



960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1SO0 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 



Seq ID NOt 51 Protein sequence: 
Protein Accession #: NP_0 65169.1 



1 

MSRRKQAKPQ 
ATVKRLRREE 
SHQPTSPGSK 
VTLQALRGTK 
MWASHALHSS 
SATSSLSPGL 
GKPPNISAVD 
TKGNLKVHFH 
TSVGLPQNLS 
TPEPGSETLK 
RAFSTKGNLK 
NPCDFTGSEP 
TLGPAMMASL 
ETTSFQALSP 
VPGTFVGPST 
NICGRAFTTK 
SVNVDPWWN 
SG1SADVEKP 



11 
I 

HINSEEDQGE 
TKVCEKCCAE 
DCHRENGGSS 
VAVNQRSADA 
GAGADTLKTL 
APPTLKPDGT 
VKPKDEAALY 
RHPQVKANPQ 
SGTNPKDLTG 
LQQLVENIDK 
THLGVHRTNT 
MTVGENGSTG 
DAPGKVGPAP 
ANSQAESIKS 
LSPGMTPLLA 
GNLKVHYMTH 
QYTSMUlGGL 
SATDGVPKHQ 



21 
I 

QQPQQQTPEF 
FFSISEFLEH 
EDMKEKPDAE 
LPAPVPGANS 
GSHMSQQVSA 
RVLPNVMSRL 
KHKCKYCSKV 
LFAEFQDKVA 
GSLPGDLQPG 
ATTDPNECLI 
SIKTQHSCPI 
AICHDDVIES 
FNLQRQGSRE 
KSPDAGSKAE 
AQPRRQAKQH 
GANNNSARRG 
AVKTNEISVI 
FPHFLEENKI 



31 
I 

ADAAPAAPAA 
KKNCTKNPPV 
SWYLKTETA 
IPWVLEQILC 
AVALLSQKAG 
PSALLPQAPG 
FGTDSSLQIH 
AGNGIPYALS 
PSPBSEGGPT 
CHRVLSCQSS 
CQKKFTNAVM 
IDVEEVSSQE 
KGSVESDGLT 
SSENSRTEME 
GCTRCGKNFS 
RKLAIENTMA 
QSGGVPTLPV 
AVS 



41 

I 

GELGAPVNHP 
LIMNDSEGPV 
LPPTPQDISY 
LQQQQLQQXQ 
SQGLSLDALK 
SVLFQSPFST 
LRSHTGERPF 
VPDPIDEPSL 
LPGVGPNYNS 
LKMHYRTHTG 
LQQHIRMHMG 
APSSSSKVPT 
NDSSSLMGDQ 
GRSSLPSTFI 
SASALQ I HER 
LLGTDGKRVS 
SLGATSWNN 



Seq ID NO: 52 DNA sequence 

Nucleic Acid Accession 8: CAT cluster 



i 11 21 

I I I 

TCTAGCATTA AAGGAAGAAA CCCCGTCTTG 

AACTTGCAGA CATAACAAGC AGAGTGTTTC 
TCTGTGAGTT GAAGGCACAC ATCACAAAGT 
ATTTGAAGAT ATTTCCTTTT CTACTGTTGG 
ATTCCACAAA AAGAGTGTTT CAAATCTGCT 
TQAATACACA CAACACAAGG AAGTTACTGA 



51 
I 

GNDEVASEDE 
PSEDFSGAVL 
LAKGKVANTN 
LTEQIRIQVN 
QAKLPHANIP 
VALDTSKKGK 
VCSVCGHRFT 
SLDSKPVLVT 
PRAGGFQGSG 
ERPFQCKICG 
GQIPNTPLPE 
PLPSIHSASP 
EYQSRSPDIL 
RAPPTYVKVE 
THTGEKPFVC 
EIFPKEILAP 
ATVSKMDGSQ 



51 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
640 
900 
960 
1020 



31 41 

i I I 

CAACGAAGGC CTCAAAGAGG TCCAAATATC 
TAAACTGCTC TAAGAAAAGA AAGGTTAAAC 
AGTTTCTGAG AATGATTCTG TCTAGTTTTT 
CATCAAATCG CTTGAAATCT CCACTTGCAA 
CTGTGTAAAG GGACGTTCCA CTCTGTGAGT 
GAATTCTTCT GTCTAGCATG ATATGAAGAA 



60 

120 

180 

240 

300 

360 



246 
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ATCCCGTTTC CAACGAAGGC CTCAATGAGG TCTATATATC CACTTGCAGA CTTTACAAAC 420 

AGAGTGTTTC CTAACTQCTC TATGAAAAGA AAGGTTAAAC TCTGTGGAGT TGAACGCACA 480 

CATCACAAAA GGAGTTTCTG AGAATCATTC TGTCTAGTTT TTTTAGGAAG ATATTTCCTT 540 
TTCTACCGTT GACTTCAAAG CGGCTGAAAT 



Saq ID NO: 53 DNA sequence 

Nucleic Acid Accession #: NMJ)01910.1 

Coding sequence! 50.. 1240 

1 ll 21 31 41 51 

| | I I I I 

GGAGAGAAGA AAGGAGGGGG CAAGGGAGAA GCTGCTGGTC GOACTCACAA TGAAAACGCT 60 

CCTTCTTTTG CTGCTGGTGC TCCTGGAGCT GGGAGAGGCC CAAGGATCCC TTCACAGGGT 120 

GCCCCTCAGG AGGCATCCGT CCCTCAAGAA GAAGCTGCGG GCACGGAGCC AGCTCTCTGA 180 

GTTCTGGAAA TCCCATAATT TGGACATGAT CCAGTTCACC GAGTCCTGCT CAATGGACCA 240 

GAGTGCCAAG GAACCCCTCA TCAACTACTT GGATATGGAA TACXTCGGCA CTATCTCCAT 300 

TGGCTCCCCA CCACAGAACT TCACTGTCAT CTTCGACACT GGCTCCTCCA ACCTCTGGGT 360 

CCCCTCTGTG TACTGCACTA GCCCAGCCTG CAAGACGCAC AGCAGGTTCC AGCCTTCCCA 420 

GTCCAGCACA TACAGCCAGC CAGGTCAATC TTTCTCCATT CAGTATGGAA CCGGGAGCTT 480 

GTCCGGGATC ATTGGAGCCG ACCAAGTCTC TGTGGAAGGA CTAACCGTGG TTGGCCAGCA 540 

GTTTGGAGAA AGTGTCACAG AGCCAGGCCA GACCTTTGTG GATGCAGAGT TTGATGGAAT 600 

TCTGGGCCTG GGATACCCCT CCTTGGCTGT GGGAGGAGTG ACTCCAGTAT TTGACAACAT 660 

GATGGCTCAG AACCTGGTGG ACTTGCCGAT GTTTTCTGTC rACATGAGCA GTAACCCAGA 720 

AGGTGGTGCG GGGAGCGAGC TGATTTTTGG AGGCTACGAC CACTCCCATT TCTCTGGGAG 780 

CCTGAATTGG GTCCCAGTCA CCAAGCAAGC TTACTGGCAG ATTGCACTGG ATAACATCCA 840 

GGTGGGAGGC ACTGTTATGT TCTGCTCCGA GGGCTGCCAG GCCATTGTGG ACACAGGGAC 900 

TTCCCTCATC ACTGGCCCTT CCGACAAGAT TAAGCAGCTG CAAAACGCCA TTGGGGCAGC 960 

CCCCGTGGAT GGAGAATATG CTGTGGAGTG TGCCAACCTT AACGTCATGC CGGATGTCAC 1020 

CTTCACCATT AACGGAGTCC CCTATACCCT CAGCCCAACT GCCTACACCC TACTGGACTT 1080 

CGTGGATGGA ATGCAGTTCT GCAGCAGTGG CTTTCAAGGA CTTGACATCC ACCCTCCAGC 1140 

TGGGCCCCTC TGGATCCTGG GGGATGTCTT CATTCGACAG TTTTACTCAG TCTTTGACCG 1200 

TGGGAATAAC CGTGTGGGAC TGGCCCCAGC AGTCCCCTAA GGAGGGGCCT TGTGTCTGTG 1260 

CCTGCCTGTC TGACAGACCT TGAATATGTT AGGCTGGGGC ATTCTTTACA CCTACAAAAA 1320 

GTTATTTTCC AGAGAATGTA GCTGTTTCCA GGGTTGCAAC TTGAATTAAG ACCAAACAGA 1380 

ACATGAGAAT ACACACACAC ACACACATAT ACACACACAC ACACTTCACA CATACACACC 1440 

ACTCCCACCA CCGTCATGAT GGAGGAATTA CGTTATACAT TCA7ATTTTG TATTGATTTT 1500 

TGATTATGAA AATCAAAAAT TTTCACATTT GATTATGAAA ATCTCCAAAC ATATGCACAA 1560 

GCAGAGATCA TGGTATAATA AATCCCTTTG CAACTCCACT CAGCCCTGAC AACCCATCCA 1620 

CACACGGCCA GGCCTGTTTA TCTACACTGC TGCCCACTCC TCTCTCCAGC TCCACATGCT 1680 

GTACCTGGAT CATTCTGAAG CAAATTCCGA GCATTACATC ATTTTGTCCA TAAATATTTC 1740 

TAACATCCTT AAATATACAA TCGGAATTCA AGCATCTCCC ATTGTCCCAC AAATGTTTGG 1800 

CTGTTTTTGT AGTTGGATTG TTTGTATTAG GATTCAAGCA AGGCCCATAT ATTGCATTTA 1860 

TTTGAAATGT CTGTAAGTCT CTTTCCATCT ACAGAGTTTA GCACATTTGA ACGTTGCTCG 1920 
TTGAAATCCC GAGGTGTCAT TTGACATGGT TCTCTGAACT TATCTTTCCT ATAAAATGGT 1980 
A^GATCT GGAGGTCTGA TTTTGTGGCA AAAATACTTC CTAGGTGGTG CTGGGTACTT 2040 
CTTGTTGCAT CCTGTCAGGA GGCAGATAAT GCTGGTGCCT CTCTATTGGT AATGTTAAGA 2100 
CTGCTGGGTG GGTTTGGAGT TCTTGGCTTT AATCATTCAT TACAAAGTTC AGCATTTT 

Seq ID NO: 54 Protein Sequence 
Protein Accession 8: NP_00 1901.1 



1 11 I 1 j 1 J 1 f 

Jktllllllv Lelgeaqgs lhrvplrrhp slkkklrars qlsefwkshn ldmiqftesc 

SMDQSAKEPL INYLDMEYFG TISIGSPPQN FTVIFDTGSS NLWVPSVYCT SPACKTHSRP 
GPSQSSTYSQ PGQSFSIQYG TGSLSGIIGA DQVSVEGLTV VGQQFGESVT EPGQTFVDAE 
FDGILGLGYP SLAVGGVTPV FDNMMAQNLV DLPMFSVYMS SNPEGGAGSE LIFGGYDHSH 
FSGSLNWVPV TKQAYWQIAL DNIQVGGTVM FCSEGCQAIV DTGTSLITGP SDKIKQLQNA 
IGAAPVDGEY AVECANLNVM PDVTPTINGV PYTLSPTAYT LLDPVDGMOF CSSGFQGLDI 
HPPAGPLWIL GDVFIRQFYS VFDRGNNRVG LAPAVP 

Seq ID NO: 55 DNA sequence 

Nucleic Acid Accession NM_018058.1 

Coding sequence: 319.. 1575 
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1 11 r i 1 r r 

TACGCGCTGC GGGACCGGCA GGGGAACGCC ATCGGGGTCA CAGCCTGCGA CATCGACGGG 60 



TAORCnCTGC iKaGAttAiw*-/*. uw\jwu-»wv»\.>- j-» «. wwwww » ~- • — 

GACGGCCGGG AGGAGATCTA CTTCCTCAAC ACCAATAATG CCTTCTCGGG GGTGGCCACG 
xtScCGACA AGTTGTTCAA GTTCCGCAAT AACCGGTGGG AAGACATCCT GAGCGATGAG 
GTCAACGTGG CCCGTGGTGT GGCCAGCCTC TTTGCCGGAC GCTCTGTGGC CTGTGTGGAC 
AGAAAGGGCT CTGGACGCTA CTCTATCTAC ATTGCCAATT ACGCCTACGG TAATGTGGGC . 
CCTGATGCCC TCATTGAAAT GGACCCTGAG GCCAGTGACC TCTCCCGGGG CATTCTGGCG 
SSg^gatg tSctgctga GGCTGGGGTC AGCAAATATA CAGGGGGCCG AGGCGTCAGC 
^CCCCA TCCTCAGCAG CAGTGCCTCG GATATCTTCT GCGACAATGA GAATGGGCCT 
AACTTCCTTT TCCACAACCG GGGCGATGGC ACCTTTGTGG ACGCTGCGGC CAGTGCTGGT 
G^CGACC CCCACCAGCA TGGGCGAGGT GTCGCCCTGG CTGACTTCAA CCGTGATGGC 
A^AGTGGACA TCGTCTATGG CAACTGGAAT GGCCCCCACC GCCTCTATCT GCAAATGAGC 



120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
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ACCCATGGGA AGGTCCGCTT CCGGGACATC 
GTCCGCACGG TCATCACCGC OGACTTTGAC 
AACATTGCCT ACCGCAGCTC CTCAGCCAAC 
GGAGACCCCC TCATCGAGGA GCTCAATCCC 
5 ACAGGGGGTG TGGTGACCGA CTTCOAOGGA 
GGACAGTCCA TGGCTCAGCC GCTGTCOGTC 
TGGCTGCGAG TGGTGCCACG CACCCGGGTT 
CTCTACACCA AGAAGAGTGG GGCCCACCTG 
TGTGAGATGG AGCCCGTGGC ACACTTTGGC 

10 GTGACGTGGC CAGATGGCAA GATGGTGAGC 
GTGCTGGAGA TCCTCTACCC CCGGGATGAG 
ACACCAATGA ATGCATCCAG TTCCCATTCG 
ACACCTATGG AAGCTACAGG TGCCGGACCA 
ACGAGGATGG CACAGCCTGC GTGGGGACTC 

15 CCCCCACCGC TGCTGCTGCC ACTGCCGCTG 
CACCGGTCCT CGTAGATGGA GATCTCAATC 
CCAGCTGCTG AGCAGGGGTG GGACATGAAC 
AAGTGGGCTT GTGCTGCTGC CTAGACAGTA 
CCCAAGCCCA TCCATGCACA TTACTTAGCT 

20 CTGTGCTGGG CACATAGCTG TGATCACAGC 
ATTCCAGTGG GTCTAATGAC CATATCTTAG 
CTGCACAGGA AGTATGAGGA CTTTAGTGTC 
AAAGCTATGT GACCTTACAC CAGTCACTTA 
AAATGGGGAT TAAGAATAGA ATCTTGGGGT 

25 GACACTTGGC ACAAAACCTG GCACATAGTA 
GGGCTTTGTC AACACGTG 



GCCTCACCCA AGTTCTCCAT GCCCTCCCCT 720 

AATGACCAGG AGCTGGAGAT CTTCTTCAAC 780 

CGCCTCTTCC GCGTCATCCG TAGAGAGCAC 840 

GGCGACGCCT TGGAGCCTGA GGGCCGGGGC 900 

GACGGGATGC TGGACCTCAT CTTGTCCCAT 960 

TTCCGGGGCA ATCAGGGCTT CAACAACAAC 1020 

GGGGCCTTTG CCAGGGGAGC TAAGGTCGTG 1080 

AGGATCATCG ACGGGGGCTC AGGCTACCTG 1140 

CTGGGGAAGG ATGAAGCCAG CAGTGTGGAG 1200 

CGGAACGTGG CCAGCGGGGA GATGAACTCA 1260 

GACACACTTC AGGACCCAGC CCCACTGGAO 1320 

TGTGCCCTCG AGACAAGCCC GTATGTGTCA 1380 

ACAAGAAGTG CAGTCGGGGC TACGAGCCCA 1440 

TCGGCCAGTC ACCGGGCCCC CGCCCCACCA 1500 

CTGCTGCCGC TGCTGGAGCT GCCACTGCTG 1560 

TGGGGTCGGT GGTTAAGGAG AGCTGCGAGC 1620 

CAGCGGATGG AGTCCAGCAG GGGAGTGGGA 1680 

GGGATGTAAA GGCCTGGGAG CTAGACCCTC 1740 

AACAATTAGG GAGACTOGTA AGGCCAGGCC 1800 

AGACAGGGTC GCTGCCCTGA TGGCGCTTAC 1860 

GACACAGATG TGCCCAGGGA GGTGGTGTCA 1920 

CTGAGTTCAA ATCCTGATTC AGGAACTCAC 1980 

ACTTGTTAGC CAT CCATT AT CGCATCTGCA 2040 

TAGTGTGGAG ATTAGATTAA ATGTATGTAA 2100 

AAGGCTCAAT AAAAACAAGT GCCTCTCACT 2160 



30 



Seq ID NO: 56 Protein Sequence 
Protein Accession #: NP_060528.1 



1 11 21 31 41 51 

MDPEASDLSR GILALRDVAA EAGVSKYTGG RGVSVGPIIiS SSASDIFCDN ENGPNFLFHN 60 

RGDGTFVDAA ASAGVDDPHQ HGRGVALADF NRDGKVDIVY GNWNGPHRLY LQMSTHGKVR 120 

35 FRDIASPKFS MPSPVRTVIT ADFDNDQELE IFFNNIAYRS SSANRLFRVI RREHGDPLIE 180 

ELNPGDALEP EGRGTGGWT DFDGDGMLDI. ILSHGESMAQ PLSVFRGNQG FNNNWLRWP 240 

RTRVGAFARG AKWLYTKKS GAHLiRI IDGG SGYLCEMEPV AHFGIjGKDEA SSVEVTWPDG 300 

KMVSRNVASG EMNSVLEILY PRDEDTLQDP APLETPMNAS SSHSCALETS PYVSTPMEAT 3 60 
GAGPTRSAVG ATSPTRMAQP AWGliSASHRA PAPPPPPLLL PLPLLLPLLE LPLLHRSS 



40 



Seq ID NO: 57 DNA sequence 

Nucleic Acid Accession ft: AJ279016.1 

Coding sequence: 1..1962 



45 i ii 2i y i 1 r 

ATGTCCAGGA TGTTACCGTT CCTGCTGCTG CTCTGGTTTC TGCCCATCAC TGAGGGGTCC 
CAGCGGGCTG AACCCATGTT CACTGCAGTC ACCAACTCAG TTCTGCCTCC TOACTATGAC 

AGTAATCCCA CCCAGCTCAA CTATGGTGTG GCAGTTACTG ATGTGGACCA TGATGGGGAC 180 

50 TTTGAGATCG TCGTGGCGGG GTACAATGGA CCCAACCTGG "CT^AAGTA ^ACCGGGCC 240 

CAGAAGCGGC TGGTGAACAT CGCGGTCGAT GAGCGCAGCT CACCCTACTA CGCGCTGCGG 300 

GACCGGCAGG GGAACGCCAT CGGGGTCACA GCCTGCGACA TCGACGGGGA OGGCCGGGAG 360 

GAGATCTACT TCCTCAACAC CAATAATGCC TTCTCGGGGG TGGCCACGTA CACCGACAAG 420 

TTGTTCAAGT TCCGCAATAA CCGGTGGGAA GACATCCTGA GOGATGAGGT CAACGTGGCC 480 

55 CGTGGTGTGG CCAGCCTCTT TGCCGGACGC TCTGTGGCCT GTGTGGACAG AAAGGGCTCT S40 
GGACGCTACT CTATCTACAT TGCCAATTAC GCCTACGGTA ATGTGGGCCC ^TGCCCTC 



60 
120 



540 
600 



GGACGCTACT CTATCTMtnl i\jLuu\i in«. U v.i»j.»"-"w». - — -~ 

ATTGAAATGO ACCCTGAGGC CAGTGACCTC TCCCGGGGCA TTCTGGCGCT CAGAGATCTG 660 

GCTGCTGAGG CTGGGGTCAG CAAATATACA GGGGGCCGAG GCGTCAGCGT GGGCCKATC 720 

CTCAGCAGCA QTGCCTCGGA TATCTTCTCC GACAATGAGA ATGGGCCTAA CTTCCTTTTC 780 

60 CACAACCGGG GCGATGGCAC CTTTGTGGAC GCTGCGGCCA GTGCTGGTGT GGACGACCCC 840 

CACCAGCATG GGCGAGGTGT CGCCCTGGCT GACTTCAACC GTGATGGCAA AGTGGACATC 900 

GTCTATGGCA ACTGGAATGG CCCCCACCGC CTCTATCTGC AAATQAGCAC CCATCGGAAG 960 

ScCGCTTCC GGGACATCGC CTCACCCAAG TTCTCCATGC CCTCCCCTGI CCGCACGGTC 1020 

aTcACCGCCG ACTTTGACAA TGACCAGGAG CTGGAGATCT TCTTCAACAA CATTGCCTAC 1080 

65 ScTCCT CAGCCAACCG CCTCTTCOGC GTCATCCGTA GAGAGCAOGG AGACCCCCTC 1140 

Sggagc TCAATCCCGG CGACGCCTTG GAGCCTQAGG GCCGGGGCAC AGGGGGTGTG 1200 

G^CGACT TCGACGGAGA CGGGATGCTG GACCTCATCT TGTCCCATGG AGAGTCCATG 1260 

GCTCAGCCGC TGTCCGTCTT CCGGGGCAAT CAGGGCTTCA ACAACAACTG GCTGCGAGTG 1320 

^CCACGCA CCCGGTTTGG GGCCTTTGCC AGGGGAGCTA AGGTCGTGCT CTACACCAAG 1380 

70 A^GAGTg5gG CCCACCTGAG GATCATCGAC GGGGGCTCAG GCTACCTGTG TGAGATGGAG 1440 

^vSr^rr^C ACTTTGGCCT GGGGAAGGAT GAAGCCAGCA GTGTGGAGGT GACGTGGCCA 1500 

SSSa £g££Sg GAACGTGGCC AGCGGGGAGA TGAACTCAGT GCTGGAGATC 1560 

CTCTACCCCC GGGATGAGGA CACACTTCAG GACCCAGCCC CACTGGAGTG TGGCCAAGGA 1620 

SScCCAGC AGGAAAATGG CCATTGCATG GACACCAATG AATGCATCCA GTTCCCATTC 1680 

75 OTGTGCCCTC QAGACAAGCC CGTATGTGTC AACACCTATO GAAGCTACAG GTGCCGGACC 1740 

5 AACAAGAAGT GCAGTCGGGG CTACGAGCCC AACGAGGATG GCACAGCCTG CGTGGGGACT 1800 
CTCGGCCAGT CACCGQGCCC CCGCCCCACC ACCCCCACCG CTGCTGCTGC CACTGCCCCT 1860 
gSgItcCCG CTGCTGGAGC TGCCACTGCT GCACCGGTCC TCGTAGATGG AGATCTCAAT 1920 

248 
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CTGGGGTCGG TGGTTAAGGA GAGCTGCGAG CCCAGCTQCT GAGCAGGGGT GGGACATGAA 1980 

CCAGCGGATQ GAGTCCAGCA GGGGAGTGGG AAAGTGGGCT TGTGCTGCTG CCTAGACAGT 2040 

AGGGATGTAA AGGCCTGGGA GCTAGACCCT CCCCAAGCCC ATCCATGCAC ATTACTTAGC 2100 

TAACAATTAG GGAGACTCQT AAGGCCAGGC CCTGTGCTGG GCACATAGCT GTGATCACAG 2160 

5 CAGACAGGGT CGCTGCCCTG ATGGCGCTTA CATTCCAGTG GGTCTAATGA CCATATCTTA 2220 

GGACACAGAT GTGCCCAGGG AGGTGGTGTC ACTGCACAGG AAGTATGAGG ACTTTAGTGT 2280 

CCTGAGTTCA AATCCTGATT CAGGAACTCA CAAAGCTATG TGACCTTACA CCAGTCACTT 2340 

AACTTGTTAG CCATCCATTA TCGCATCTGC AAAATGGGGA TTAAGAATAG AATCTTGGGG 2400 

TTAGTGTGGA GATTAGATTA AATGTATGTA AGAC ACTTG G CACAAAACCT GGCACATAGT 2460 
10 AAAGGCTCAA TAAAAACAAG TGCCTCTCAC TGGGCTTTGT CAACACG 



Seq ID NO: 58 Protein Sequence 
Protein Accession 8: CAC08451.1 

15 1 11 21 

I I 1 

MSRMLPFLLL LWFLPITEGS QRAEPMFTAV 
FEIWAGYNG PNLVLKYDRA QKRLVNIAVD 
EIYFLNTNNA FSGVATYTDK LFKFRNNRWE 

20 GRYSIYIANY AYGNVGPDAL IEMDPEASDL 
LSSSASDIFC DNENGPNPLF HNRGDGTFVD 
VYGNWNGPHR LYLQMSTHGK VRFRDIASPK 
RSSSANRLFR VIRREHGDPL IEELNPGDAL 
AQPLSVFRGN QGPNNNWLRV VPRTRFGAFA 

25 PVAHFGLGKD EASSVEVTWP DGKMVSRKVA 
FSQQENGKCM DTNECIQFPF VCPRDKPVCV 
LGQSPGPRPT TPTAAAATAA AAAAAGAATA 



31 41 51 

I I I 

TNSVLPPDYD SNPTQIiNYGV AVTDVDHDGD 60 

ERSSPYYALR DRQGNAIGVT ACDIDGDGRE 120 

DILSDEVNVA RGVASLFAGR SVACVDRKGS 180 

SRGILALRDV AAEAGVSKYT GGRGVSVGPI 240 

AAASAGVDDP HQHGRGVALA DFNRDGKVDI 300 

FSMPSPVRTV ITADFDNDQE LEIFFNNIAY 360 

EPEGRGTGGV VTDFDGDGML DLILSHGESM 420 

RGAKWLYTK KSGAHLRIID GGSGYLCEME 480 

SGEMNSVLEI LYPRDSDTLQ DPAPLECGQG 540 

NTYGSYRCRT NKKCSRGYEP NEDGTACVGT 600 
APVtiVDGDLN LGSWKESCE PSC 
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Seq ID NO: 59 DNA sequence 
Nucleic Acid Accession ft: 
Coding sequence : 1 . . 4794 



I 



ll 
I 



21 



31 



41 



51 



ATGGCGTGTC 
AGCGGCTCCT 
GTTCTGAAGT 
TCACCCTACT 
ATCGACGGGG 
CACAGCAGCT 
CCACCTACAA 
TCCTCCCTGG 
TGTCGGGGTG 
GGGGTGGCCA 
CTGAGCGATG 
GCCTGTGTGG 
GGTAATGTGG 
GGCATTCTGG 
TTCTCCCACA 
GGAGGAGACC 
TGCCGGCTGG 
CAGAGGGAGG 
TCCAAAAGCC 
GCGCCTTCTC 
CCCCTTGTCA 
CCCCACCCCC 
CTGATGGCTG 
CTGAGAAGCT 
GAGCTGGGAG 
CTGGGAGAAC 
CCCAAGGTCA 
GGCCCCGGGA 
CTCTCCCATC 
GTGCCGGGAG 
CTGGCGTGGA 
TTTAGGCTCA 
CTGCAGTTCC 
TCTGCCACTC 
ATCCTCAGCA 
TTCCACAACC 
GCCTTCATOG 

ctagcagaaa 
ccacattgcc 
ttcttgacgc 

CAGGGGGCCC 
ACTGCCTATT 
TTGTCCTCTG 
GCCCTGGCTG 



CGGGAGGACT 
CCCCAGCATC 
ATGACCGGGC 
ACGCGCTGCG 
ACGGCCGGGA 
CAGCGCAGGT 
CCCCTGCAGG 
GTCAGGCTTC 
GACTGAGACC 
CGTACACCGA 
AGGTCAACGT 
ACAGAAAGGG 
GCCCTGATGC 
CGCTCAGAGA 
CTGCCTCTCC 
CAGAGGAGGC 
GCTGGAAGGA 
CTGGGGCAGC 
ATTTGGCTGA 
CAGCCCACCC 
CTCAGCTAAT 
GAGCCCCAGG 
AGGCTTTGGG 
GGGAGGAAAG 
GTCCCTGGAG 
CTCCCATTTT 
CACAGGAGTG 
GGGTGGCCAA 
CCCTGGTCCC 
CTGCCCTGCC 
ACCAGATGGA 
GGAAAGCACG 
CCTCAGGCCT 
ACTGTGGGTC 
GCAGTGCCTC 
GGGGCGATGG 
TTCACCTCAA 
CTGGTCCTTC 
ATCATGGTTT 
AAGGCTTGGC 
CACCCTGCCT 
ACATTGTCCT 
AAAGAGTCAA 
ACTTCAACCG 



CCCAGCCCGT 
CCCTCCCCAT 
CCAGAAGCGG 
GGACCGGCAG 
GGAGATCTAC 
CCCTTCTGGG 
CCTCCTGGGT 
TCCGGACAGC 
TACCCATGAA 
CAAGTTGTTC 
GGCCCGTGGT 
CTCTGGACGC 
CCTCATTGAA 
TGTGGCTGCT 
AAGCATTGGT 
AGATGAGGAG 
CGGGCAGTTC 
TGGCGTGCCC 
CAAGAACCTA 
TTTCCCTGCC 
GACACATGGA 
AATGGACCCC 
CGCGTGGCCA 
CAGGCAGAAG 
CCAAGCCACA 
ACAAAGAACA 
CCATCTAGTG 
GCGAGAGATT 
CAACTTCCCC 
TGGGAATCCT 
AAAAGAGGAG 
GGAAGCAGAA 
CAGAGGCAGC 
GATGTCTTTT 
GGATATCTTC 
CACCTTTGTG 
ATATCACCTC 
CTCCTCCTGC 
GTCTATGAGC 
CTCCAGTGCC 
TCTGGCAAGA 
GTGGTCTGCC 
CGTGGGTGTG 
TGATGGCAAA 



TGCTCTGGTT 
TCCTCCTCCA 
CTGGTGAACA 
GGGAACGCCA 
TTCCTCAACA 
CTCCACAGAA 
CTGCCTCCAC 
AGGCAGGGAG 
CCAGAACCAT 
AAGTTCCGCA 
GTGGCCAGCC 
TACTCTATCT 
ATGGACCCTG 
GAGGCTGGGG 
GAGATATCTG 
CACAGTGGGG 
AAGGAAGAAG 
AGAGGACGTG 
TTTGGCCCAC 
CGCCAAGCCC 
CGTCTGGCTG 
AAATGTAAGG 
GCGCTCAGCA 
GGGCAGGCCA 
CAGCACCTGC 
GACGGAGATC 
GCCACCATGC 
GGGAGAGAGA 
AGCTGCTTGA 
GGGAACTGGG 
GGGAAGATTC 
TTCCCCCCAG 
CCTGTCCTCC 
CTAGGGGGCC 
TGCGACAATG 
GACGCTGCGG 
TGCAGAGATT 
TGCCCGTGGC 
TTTACAAGGA 
CACCGGAGGA 
GCTCCCTGTG 
ATCCCAGAGA 
GACGACCCCC 
GTGGACATCG 



GGATGGGACT 
GGTACAATGG 
TCGCGGTCGA 
TCGGGGTCAC 
CCAATAATGC 
ACAGGCCTGT 
TCAGCGGAAG 
AGAGGGTGCC 
TTCTTCTGAG 
ATAACCGGTG 
TCTTTGCCGG 
ACATTGCCAA 
AGGCCAGTGA 
TCAGCAAATA 
GCAGAACCGA 
ATGGAAGCAC 
CAGCAGCTTT 
TTCGAACAGC 
CATGTTACTA 
CCCAACACTA 
GAAAACTAGC 
GCCGCCATGC 
CCACTGTGGT 
TGTCCAGATG 
CTGCTAGAGA 
CAGGGAGGAG 
CAGCTCTCGG 
CTGGGGCAGT 
GGCCTCTTGA 
TTCTGGACAT 
ATGGAGACCA 
GCTCCTCTGA 
AGGTGGGCCT 
GAGGCGTCAG 
AGAATGGGCC 
CCAGTGCTGA 

rrccTCACTC 

ATGCACGTCT 
CCGGGTCACG 
CACTCAGCCT 
TCCTGGGGTC 
GCCTGATGAC 
ACCAGCATGG 
TCTATGGCAA 



GGGTGGGCCC 
ACCCAACCTG 
TGAGCGCAGC 
AGCCTGCGAC 
CTTCTCGGGC 
GCTGAAGCCT 
GGACTTTTCC 
GGTTCCCTGC 
ACCCAAATCA 
GGAAGACATC 
ACGCTCTGTG 
TTACGCCTAC 
CCTCTCCCGG 
TACAGAAGGC 
GGAGCGGGAA 
CAGCCAACTG 
GGTGGAGGAA 
TCTGCAGACT 
TTCTGTCTGC 
CCCTGTAGCC 
CCGGAGTGTC 
TGAGCCCGGC 
GCCAGGGGGC 
TGCACTCAGG 
GCTGTATGAC 
AAGGGACTCG 
GGGACTCGAG 
AGGAAGACCA 
AGCCGGGACA 
GGCCAAGGCC 
TGAGCCCAGA 
GGAGCCTCTG 
GGGGCTTGCT 
CGTGGGCCCC 
TAACTTCCTT 
ACGTCGTTTA 
CCTGTGCCAC 
TCTTCAGGCT 
GTTCTATTCA 
CCAGGGTTCT 
TCTGATCCCC 
CCACAGCTAT 
GCGAGGTGTC 
CTGGAATGGC 
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240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1660 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 
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CCCCACCGCC TCTATCTGCA AATGAGCACC CATGGGAAGG TCCGCTTCCG GGACATOGCC 2700 

TCACCCAAGT TCTCCATGCC CTCCCCTGTC CGCACGGTCA TCACCGCCGA CTTTGACAAT 2760 

GACCAGGAGC TGGAGATCTT CTTCAACAAC ATTGCCTACC GCAGCTCCTC AGCCAACCGC 2820 

CTCTTCCGAT GCTCCATCCT GGCTCGTGGC TCTTCATCCT TGACAGCTGG TGGGAGGAAC 2880 

GGTCAGGGAG AAGGTTTAAG AATCAGAAGG GGAGGGTTCC CAGGGCCAGG GGGTCAGGCC 2940 

AAGGTCAACA CAGGTCCCCT GATGAAGAAA CAGAAAGGAA GGAAGGACGA GGACTGGGCA 3000 

AGAGGCTGTG GGAATGCAGG GCAAAGCCTG GCCAAGGAGC CGGCCTCTGC TATTGCAGGG 3060 

AAAGGGAAGG GAAATGTGGC CCAAAGTGTG CCCAGAACCC AAGCGCCACA AGATACAAAG ,3120 

CCACACTACC ACAAAAAGGG GCTACAGGGT CCAATCACTA CCAGGAAAAG GGGCTACGGG 3180 

GTCCAATCAC TACCAGGAAA AGGGGCTACG GGGTCCAATC ACTACCAGGA AAAGGGGCTA 3240 

CGGGGTCCAA TCACTACCAG GAAAAGGGGC TACGGGGTCC AATCACTACC AGGAAAAGGG 3300 

GCTACGGGCT CCAATCACTA CCAGGAAAAG GGGCTACAGG GTCCAATCAC TACCAGGAAA 3360 

AGGGGCTACG GGCTCCAATC ACTACCAGGA AAAGGGGCTA CAGGGTCCAA TCACTACCAC 3420 

AGAAAGGGGC TACGGGCTCC AATCACTACC AGGAAAAGGG GCTACGGGGT CCAATCACTA 3480 

CCAGGAAAAG GGGCTACAGG GTCCAATCAC TACCAGGAAA AGGGGCTACG GGGTCCAATC 3540 

ACTACCAGGA AAAGGGGCTA CGGGCTCCAA TCACTACCAG GAAAAGGGGC TACGGGGTCC 3600 

AATCACTACC AGGAAAAGGG GCTACAGGGT CCAATCACTA CCAGGAAAAG GGGCTACAGG 3660 

GTCCAATCAC TACCACAGAA AGGGGCTACG GGCTCCAATC ACTACCAGGA AAAGGGGCTA 3720 

CGGGGTCCAA TCACTACCAG GAAAAGGGGC TACGGGCTCC AATCACTACC AGGAAAAGAG 3780 

GCTATGGGGT CCAATCACTA CCAGGAAAAG GGGCTACGGG CTCCAATCAC TACCAGGAAA 3 840 

AGGGGCTATG GGGTCCAATC ACTACCACAG AAAGGGGCTA CGGGGTCCAA CGTCATCCGT 3900 

AGAGAGCACG GAGACCCCCT CATCGAGGAG CTCAATCCCG GCGACGCCTT GGAGCCTGAG 3960 

GGCCGGGGCA CAGGGGGTGT GGTGACCGAC TTCGACGGAG ACGGGATGCT GGACCTCATC 4020 

TTGTCCCATG GAGAGTCCAT GGCTCAGCCG CTGTCCGTCT TCCGGGGCAA TCAGGGCTTC 40B0 

AACAACAACT GGCTGCGAGT GGTGCCACGC ACCCGGTTTG GGGCCTTTGC CAGGGGAGCT 4140 

AAGGTCGTGC TCTACACCAA GAAGAGTGGG GCCCACCTGA GGATCATCGA CGGGGGCTCA 4200 

GGCTACCTGT GTGAGATGGA GCCCGTGGCA CACTTTGGCC TGGGGAAGGA TGAAGCCAGC 4260 

AGTGTGGAGG TGACGTGGCC AGATGGCAAQ ATGGTGAGCC GGAACGTGGC CAGCGGGGAG 4320 

ATGAACTCAG TGCTGGAGAT CCTCTACCCC CGGGATGAGG ACACACTTCA GGACCCAGCC 4380 

CCACTGGAGT GTGGCCAAGG ATTCTCCCAG CAGGAAAATG GCCATTGCAT GGACACCAAT 4440 

GAATGCATCC AGTTCCCATT CGTGTGCCCT CGAGACAAGC CCGTATGTGT CAACACCTAT 4500 

GGAAGCTACA GGTGCCGGAC CAACAAGAAG TGCAGTCGGG GCTACGAGCC CAACGAGGAT 4560 

GGCACAGCCT GCGTGGGTAC TGAGCTAGGC TCTAGGCATA CAATGACGTG GAAACCAAGG 4620 

CCCAAAAAGG AGCTGCAACT TTCCCAAGGC ATCTGCACCC CCGTCTGGTC CTTTTTCCTG 4680 

CCGGGTTGCC GGCTGCTCCT CAAAAGAGCT CAGCTCCAGG CTGCTCCCAG CACCCTTCTC 4740 
CAGAAAGCTC CAGGTATTCC AGAAGCCCAA GTGTATGAAC AAGATCAGGA ATAA 



Seq ID NO: 60 Protein Sequence 
Protein Accession 8: FGENESH 



MACPGGLPAR 
SPYYALRDRQ 
PPTTPAGLLG 
GVATYTDKLF 
GNVGPDALIE 
GGDPEEADBE 
SKSHLADKNL 
PHPRAPGMDP 
ELGGPWSQAT 
GPGRVAKREI 



11 
I 

CSGWMGLGGP 
GNAIGVTACD 
LPPLSGRDFS 
KFRNNRWEDI 
MDPEASDLSR 



SATHCGSMSF 
AFIVHLKYHL 
FLTQGLASSA 
LSSERVNVGV 
SPKFSMPSPV 
GOGEGLRIRR 
KGKGNVAQSV 
RGPITTRKRG 
RKGLRAPITT 
NHYQEKGLQG 
AMGSNHYQEK 
GRGTGGWTD 
KWLYTKKSG 
MNSVtiEILYP 
GSYRCRTNKK 
PGCRLLLKRA 



FGPPCYYSVC 
KCKGRHAEPG 
QHLPARELYD 
GRETGAVGRP 
GKIHGDHEPR 
LGGRGVSVGP 
CRDFPHSLCH 
HRRTLSIjQGS 
DDPHQHGRGV 
RTVITADFDN 
GGFPGPGGQA 
PRTQAPQDTK 
YGVQSLPGKG 
RKRGYGVQSL 
PITTRKRGYR 
GLRAPITTRK 
PDGDGMLDL.I 
AHLRI IDGGS 
RDEDTLQDPA 
CSRGYEPNED 
QLQAAPSTLL 



21 
I 

SGSSPASPPH 
IDGDGREEIY 
SSLGQASPDS 
LSDEVNVARG 
GILALiRDVAA 
CRLGWKDGQF 
APSPAHPFPA 
LMAEALGAWP 
LGEPPILQRT 
LSHPLVPNFP 
FRDRKAREAE 
ILSSSASDIP 
LAETGPSSSC 
QGAPPCLIiAR 
ALADFNRDGK 
DQELEIFFNN 
KVNTGPLMKK 
PHYHKKGLQG 
ATGSNHYQEK 
PGKGATGSNH 
VQSLPQKGAT 
RGYGVQSIiPQ 
LSHGESMAQP 
GYLCEMEPVA 
PLECGQGFSQ 
GTACVGTELG 
QKAPGIPEAQ 



31 



SSSRYNGPNL 
FIiNTNNAFSG 
RQGERVPVPC 
VASLFAGRSV 
EAGVSKYTEG 
KEEAAALVEE 
RQAPQHYPVA 
ALSTTWPGG 
DGDPGRRRDS 
SCLRPLEAGT 
FPPGSSEEPL 
CDNENGPNFL 
CPWHARLLQA 
APCVLGSLIP 
VDIVYGNWNG 
IAYRSSSANR 
QKGRKDEDWA 
PITTRKRGYG 
GLQGPITTRK 
YQEKGLRGPI 
GSNHYQEKGL 
KGATGSNVIR 
LSVFRGNQGF 
HFGLGKDEAS 
QENGHCMDTN 
SRHTMTWKPR 
VYEQDQE 



41 

I 

VLKYDRAQKR 
HSSSAQVPSG 
CRGGLRPTHE 
ACVDRKGSGR 
FSHTASPSIG 
QREAGAAGVP 
PLVTQLMTHG 
LRSWEESRQK 
PKVTQECHLV 
VPGAALPGNP 
LQFPSGLRGS 
FHNRGDGTFV 
PHCHHGLSMS 
TAYYIVLWSA 
PHRLYLQMST 
LFRCSILARG 
RGCGNAGQSL 
VQSLPGKGAT 
RGYGLQSLPG 
TTRKRGYGLQ 
RGPITTRKRG 
REHGDPLIEE 
NNNVIIiRWPR 
SVEVTWPDGK 
ECIQFPFVCP 
PKKELQLSQG 



51 
I 

LVNIAVDERS 
LHRNRPVLKP 
PEPFIiLRPKS 
YSIY1ANYAY 
EISGRTEERE 
RGRVRTALQT 
RLAGKLARSV 
GQAMSRCALR 
ATMPALGGLE 
GNWVLDMAKA 
PVLQVGLGLA 
OAAASAERRIi 
FTRTGSRFYS 
IPESLMTHSY 
HGKVRFRDIA 
SSSLTAGGRN 
AKEPASAIAG 
GSNHYQEKGIj 
KGATGSNHYH 
SLPGKGATGS 

yglqslpgke 
lnpgdalepe 
trfgafarga 

MVSRNVASGE 
RDKPVCVNTY 
ICTPWISFFL 



Seq ID UOt 61 DNA sequence 

Nucleic Acid Accession ft: NM_0Q05B4.i 

Coding sequencer 75.. 374 

1 11 21 31 « Si 

AGCAGAGCAC ACAAGCTTCT LgACAAGAG CCAGGAAGAA ACCACCGGAA GGAACCATCT 



60 

120 
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300 

360 
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60 



250 
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5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 



CACTGTGTGT 
CTGCAGCTCT 
GCATAAAGAC 
AGAGTGGACC 
TCTGTCTGGA 
CTGAGAATTC 
GAAACTTCAA 
CCAGATGCAA 
CATTGTACCA 
CAAAAAACAA 
TACAAATAGC 
GAATGGGTTT 
TTTGCCATAA 
CCTAGTCTGC 
TTTCTAAGTG 
TGGAAGCACT 
CTATTTATTA 
AGAAGATGAA 
GATATTAAAT 
ACAATTGGGT 
AGTACATTAT 
AGTCTTGTCA 
AACTATTAAA 
TGTTTATTAT 
AAGGCTTTAT 
TTCTGATTGT 
AAGTAAAAAA 



AAACATGACT 
GTGTGAAGGT 
ATACTCCAAA 
ACACTGCGCC 
CCCCAAGGAA 
ATAAAAAAAT 
GCAAATCTAC 
TACAAGATTC 
TGAAATATCC 
CAAATAATTT 
AAAATTGAGC 
GCTAGAATGT 
AGTCAAATTT 
TAGCCAGGAT 
GAAAAAGTAT 
TTAAGTTTTT 
TTTATGTATT 
TCATTGATTG 
GATGTTTTAT 
ACCCAGTTAA 
TGTTTATCTG 
TTGCCAGCTG 
ACAGCCAAAA 
GTACAAATAG 
ATTTTTAACT 
ATGGAAATAT 
AAAAAAAAA 



TCCAAGCTGG 
GCAGTTTTGC 
CCTTTCCACC 
AACACAGAAA 
AACTGGGTGC 
TCATTCTCTG 
TTCAACACTT 
CTGGTTAAAT 
AGAACATACT 
TTAAATATAA 
CAAGGGCCAA 
GATATTTGAA 
AGCTGGAAAT 
CCACAAGTCC 
TAGCCACCAT 
TCATCATAAC 
TATTTAAGCA 
AATAGTTATA 
TAGATAAATT 
ATTTTCATTT 
AAAGTTTTAA 
TGTTGGTAGT 
CTCCACAGTC 
ATTCTTATAA 
TTAAGATGTT 
AAAAGTAAAT 



CCGTGGCTCT 
CAAGGAGTGC 
CCAAATTTAT 
TTATTGTAAA 
AGAGGGTTGT 
TGGTATCCAA 
CATGTATTGT 
TTGAATTTCA 
TATATGTAAA 
GGATTTTCCT 
GAGAATATCC 
GCATCACATA 
CCTGGATTTT 
TTGTTCCACT 
CTTACCTCAC 
ATAAATTATT 
TCAAATATTT 
AAGATGTTAT 
TCAATCAGGG 
CAGATAAACA 
TTGAACTAAC 
GCTGTGTTGA 
AATATTAGTA 
TATTATTTAA 
TTTATGTGCT 
ATGAAACATT 



CTTGGCAGCC 
TAAAGAACTT 
CAAAGAACTG 
GCTTTCTGAT 
GGAGAAGTTT 
GAATCAGTGA 
GTGGGTCTGT 
GTAAACAATG 
GTATTATTTA 
AGATATTGCA 
GAACTTTAAT 
AAAATGATGG 
TTTCTGTTAA 
GTGCCTTGGT 
AGTGATGTTG 
TTCAAGTGTA 
GTGCAAGAAT 
AGTAAATTTA 
TTTTTAGATT 
ACAAATAATT 
AATCCTAGTT 
ATTACGGAAT 
ATTTCTTGCT 
ATGACTGCAT 
CTCCAAATTT 
TAAAATATAA 



TTCCTGATTT 
AGATGTCAGT 
AGAGTGATTG 
GGAAGAGAGC 
TTGAAGAGGG 
AGATGCCAGT 
TGTAGGGTTG 
AATAGTTTTT 
TTTGAATCTA 
CGGGAGAATA 
TTCAGGAATT 
GACAATAAAT 
ATCTGGCAAC 
TTCTCCTTTA 
TGAGGACATG 
ACTTATTAAC 
TTGGAAAAAT 
TTTTATTTTA 
AAACAAAGAA 
TTTTAGTATA 
TGATACTCCC 
AATGAGTTAG 
GGTTGAAACT 
TTTTAAATAC 
TTTTTACTGT 
TTTGTTGTCA 
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Seq ID NO: 62 Protein Sequence 
Protein Accession #: NP_000575.1 



11 



21 



31 
I 



41 



51 



MTSKLAVALL AAFLISAALC EGAVLPRSAK ELRCQCIKTY SKPFHPKFIK ELRVIESGPH 
CANTEIIVKL SDGRELCLDP KENWVQRWE KFLKRAENS 

Seq ID NO: 63 DNA sequence 

Nucleic Acid Accession #: EOS sequence 

Coding sequence: 52-1146 



GTTACCCAGC 
CAAGTGGTCA 
GTTGGTGGAG 
TCCATGTGCT 
ACCAATGGAA 
TCAAGAAGGG 
TGTTGCCGTG 
ATTGTGCCAC 
GTTCGAACGA 
ACAACTGTTC 
GTTTCAACGA 
GTGACAACAA 
CCAGTAGCCA 
GGAGCAATAA 
GACACCGTGA 
GAACATAGTC 
GTCTTGGTGC 
GAGGTTCAAC 
GTTGAAAAGG 
GACTAAGACC 
ACATCAGCAC 
GGCATTCCAA 
GTCAACCTCA 
ATCATTGCAG 
ATAAATCCTG 
TTGGCCATCA 
GAGACTTCAT 
TTGCATTTCC 
AACAAAGTGA 
ATTAAGTACT 
TATGGAACAC 
GCAATTTTTT 
ATGGTGAAAT 
CAGCCTCACG 
TTTTTTTAGT 
CAGAGGATCC 



11 
I 

ATTGTGAGTG 
TCTTAAGCCT 
AGGCAGGTCC 
GGAATAGAGG 
CCCACGTCAC 
ATGTCTCTTT 
TTGAGCACCG 
CCAAGGTCAC 
GCACCACTGT 
CAACAACAAT 
CAACGAGCGT 
CTGTCTCTAC 
CTTCACCATC 
GGAGAGAACC 
CAGAGTCTTC 
TACTGACGGC 
TTCTTGCTCT 
AACTAAGTGT 
AAGTCCAAGC 
CAGTGGTGCT 
ATCAGACGTC 
CATGTCAGTG 
TCATTAATGT 
TTTTCTCTCA 
TATACATCAA 
CAGCTGTCCA 
TTGGAAGCAT 
TATGTTTTCC 
GTCTAACTCA 
GGATCTCTGA 
ATAGACACCA 
T TTTT T T T T T 
CTCGGCTCAC 
AGTAGCTGGG 
AGAGACAGGG 
AGCCTCCTTG 



21 



ACAGAGCCTG 
CATCCTACAT 
ATCTGTCACA 
CTCATGTTCT 
CTATCGGAAG 
GACCATAGAA 
TGGGTGGTTC 
GACTACTCCA 
TCCAACGACA 
GAGCATTCCA 
TCCAACGACA 
CTTTGTTCCT 
TTCACCTCAG 
CACCAGCTCA 
AGATGGCCTT 
CAATACCACT 
TTTGGGTGTC 
TTCATTTAGC 
AGAAGACAAT 
CTTTGAGAGT 
TTTTAGACCC 
ATACTGGGTA 
AGTCCTAATT 
AATATGAACA 
AGAGAATAGT 
GAAGAGGGGA 
TGTATGATTA 
AAAGGTTTCA 
CAGGTTTAGC 
ATTGGGGTAG 
GGGGAAGAAA 
TGAGATGGAG 
TGTAACCTCC 
ATTACAGGTG 
TTTCATCATG 
GCCTTCCAAA 



31 

I 



41 



GATCTGAACG 
CTGGCAGATT 
CTACCCTGCC 
CTATTCACAT 
GACACACGCT 
AATACAGCTG 
AATGACATGA 
ATTGTCACAA 
ACGACTGTTC 
ACGACAACGA 
ACGAGCATTC 
CCAATGCCTT 
CCAGCAGAAA 
CCATTGTACT 
TGGAATAACA 
AAAGGAATCT 
ATCATTGCCA 
AGCCTTCAAA 
ATCTACATTG 
TTACGCCCAT 
CAAGACAATT 
GAGTAACTCT 
TTTTATGCTA 
CTTTATAATT 
CCCTGGAAAC 
ATCTGTCTTA 
TCTCTTGTTT 
AATCGTGGGT 
TGTTTTCTCA 
CTGTTTTACC 
ATCATTTGCC 
CTTTGCTCTT 
ACCTTCCGGG 
TCCGCCACTA 
TTGGCCAGAA 
GTGCTGGGAT 



CTGATCCCAT 
CTGTAGCTGG 
ACTACAGTGG 
GCCAAAATGG 
ATAAGCTATT 
TGTCTGACAG 
AAATCACCGT 
CTGTTCCAAC 
CAATGACGAC 
CTGTTCTGAC 
CAACAACAAC 
TGCCCAGGCA 
CCCACCCTAC 
CTTACACAAC 
ATCAAACTCA 
ATGCTGGAGT 
AAAAGTATTT 
TTAAAGCTTT 
AGAATAGTCT 
GAGTGCAGAA 
TTTCTGTTTC 
CTCACTCCAA 
AAACTGGCTC 
GTATGTTCTT 
ATAGCAAATG 
AAAACCAGCA 
CTATGTTATA 
TTTTATTTCC 
TAACTCTGGA 
AGTTAAAGAG 
AGGTGATTTA 
GTTGCCCAGG 
TTCAAGCAAT 
TGCTCAGCTA 
TGATCTCGAA 
TACAGGCATG 



60 



51 
I 

AATGCATCCT 
TTCTGTAAAG 
AGCTGTCACA 
CATTGTCTGG 
GGGGGACCTT 
TGGCGTATAT 
ATCATTGGAG 
CGTCACGACT 
TGTTCCAACG 
GACAATGACT 
AAGTGTTCCA 
GAACCATGAA 
GACACTGCAG 
AGATGGGAAT 
ACTGTTCCTA 
CTGTATTTCT 
CTTCAAAAAG 
GCAAAATGCA 
TTATGCCACG 
GACTGAACAG 
AGTTTCATCT 
ACTGTGTATA 
AATCCTTCTG 
TTTAGACCCC 
AACTTCTATC 
AATCCAACGT 
CTTCCAAATG 
TCCGTGGGGA 
AATGTGATGC 
CCTACAATAG 
ACATATTTAT 
CTGGAGTGCG 
TCTCCCGCCT 
ATTTTTTGTA 
CTCCTGACCT 
AGCCACCGTG 
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251 
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CCTGGTCCAT TTATGCAATT TTTATTACAC 
CATCAGGCTT TCTGAACAAA TTTTTAGAAC 
AGGCGGGAGT AGGGAAAAGT CTGCCTTTTC 
GTTTGAAACT CACTAGTTTT GACCCAAATT 
5 AAAATTGTTT AGGAGGCTAG CGCCGTGGCT 
CAAGGTGGGC AGGTAGCTTG AGCCCAGAAG 
CCCCATCTCT ATTAAGAAAA AAATTTAAGG 
AGGCAAAGTG TCTTTCTCAT ATCTAATCAA 
AATTCAATTA GGCTTATTGT AAGGATATCT 

10 TTATACTAAA CCAGTATAAT TGTTTAGAAG 
ATGCCTGTAA TCCCAGGGCT TTGGAAGGCT 
TTGAGACCAG CCTGACCAAC ATGGAGAAAC 
GGCTAGATTT GTCAGAAAAG AAACATTTCT 
CAATTTCAGT TTGAGTAAAC TCTGAAATAA 

15 CATAGTTGTC TGGGGCAACA GGATGCTTGC 

GTTTTTTTGA GATGGAGTTT TGCTCTTGTC 
GGCTCACTGC AACCTCCACC TCCCAGGTTC 
AGCTGGGATT ACAGATGCCC GCCACAACGC 
ACCATATTGG CCAGGCTGGC CTCCAACTCC 

20 CCCCAAAGTG TTGGGATTAC AGGCATAAGC 
AACGCAGTTA TCATTCTAGG AAATTTATAG 
AAGATTGTGA TACATGGTAT TTCAGTTTTC 
ATTTTATTTT TTAAAAGCTA TAGTGTCTTT 
ATTGGAGATT TGAAATTAAA GCCAAGATTT 

25 GGACTGGATT GCTATGTGAC CAGGGTCCCA 
ATCCTTATGT TCCCTGCATG CTCACCTTTG 
CTTTATGGCA ACTGCCCTAT TCTCTGGATC 
CAGGAATCTG GTTAGCTTTG CTTAACACAT 
GTCACTAGTA GCATTCTGTG CAGTACAATG 

30 CAGGATTAGG GATAGGACTT GAGTCTAATC 
TAGAGCAAAT GGTTCTTTAC GTCTACATTT 
TCAACTATGA TGTACAGTGC TAATTCCAAT 
GTAAAAAACA CTACAGATAC TGAAGCAGTT 
GCTTGGTAAA TAATTTTTTT TTTCTTTTGA 

35 AGTGTAGTG 



GTGTGTCATG GTGCTAAGCA TTTAAACAAC 2220 

CAAGAAGATC AGCAATAAAA GAAAAAAAAA 2280 

TGCCTTATAA GTTACATTGC AGATATAAGT 2340 

CTAAATCGCC TCCAAACCAC TTTCATGTTT 2400 

CATGCCTGTA ATCCCAGCAC TTTGTGAAGC 2460 

TTTGAGACCA GCCTGGGCAA CATGGCAATA 2S20 

AAAAAAAAAA AAGATAAAAT TGTTTTGGAA 2580 

TTCCTGTATT TTTTTTTT TT TTTTTTTTAC 2640 

TTAGTCAGTA TAAAAATATA TAAAAAACAA 2700 

TATATATTTG COGGCCGGGC ATGGTGGCTC 2760 

GAGGTGAGTG GATCACTAGA GGTCGGGAGT 2820 

ACTGTTCTCT ACTAAGTATA GATTTGCAAG 2880 

ACATGAACAG TTAAGGCTGG GTATTAATTT 2940 

TTCATGGTTG AGGTTCAGAG CTACCCATAC 3000 

TTTCAGAGGA AACTTTTTGT TTTTGTTTTT 3060 

ACCCAGGCTG AAATGCAATG GCACAATCTT 3120 

AAGTGATTCT CCTGCCTCAG CCTCCTGAGT 31B0 

CCAGCTTATT TTTAGTAGAG ACAGGGTTTC 3240 

TGACCTCAGG TGAGCCACCC GCTTTGGCCT 3300 

CACCGCGCCC AGCCAGAGGC AACATTTTTT 3360 

GTCCTTTGAA GGAAAATTCT GTGGGCAAAT 3420 

CCAAATGTGG CCAGCCCGAT CTGGTCAAAA 3480 

TTTTCTTAAA TTTGAGGCAA CATGCACAAA 3540 

GTAGTTTCTC TGGAAAGACC TGGCAAGATT 3600 

CTAGATGGGG CTGCATCCTC TAATCCCCAA 3660 

TTACCTGCCT GACACCTGTG GGGCTTTTAA 3720 

CTTCCTGAGG ATTTATGATG CGTAATACTC 3780 

TTCCAAAACT TGTTTGAATG CATGAGTACA 3840 

TATGGGGGCT TAGGAGTTTA 1 GGGTAGTATA 3900 

CTAACTCTTA GCAGTTACAC TGGATGACAT 3960 

TCTTCATCTG TAGATGTAAT AATTTCCATA 4020 

GAAATGTTAC ATGTGAGAAG TCTTTGAAAT 4080 

TGGAGAATTA AAAAACACTA CGAAAACACA 4140 

GACAGAGTCT GGCTCTGTCA TCCAGGCTAG 4200 



Seq ID NO: 64 Protein sequence; 
Protein Accession #: NPJ)36338.1 

40 i ii 2i 

I 1 I 

KHPQWILSIi ILHLADSVAG SVKVGGEAGP 
IVWTNGTHVT YRKDTRYKLL GDLSRRDVSL 
SLEIVPPKVT TTPIVTTVPT VTTVRTSTTV 
45 TTTSVPTTTS IPTTTSVPVT TTVSTFVPPM 

IRREPTSSPL YSYTTDGNDT VTESSDGLWN 
VLLALLGVH AKKYFFKKEV QQLSVSFSSL 



31 41 SI 

SVTLPCHYSG AVTSMCWNRG SCSLFTOQNG 60 

TIENTAVSDS GVYCCRVEHR GWFNDMXITV 120 

PTTTTVPTTT VPTTMSIPTT TTVPTTMTVS 1B0 

PLPRQNHEPV ATSPSSPQPA ETHPTTLQGA 240 

NNQTQLFLEH SLLTANTTKG IYAGVCISVL 300 
QIKALQNAVE KEVQAEDNIY IENSLYATD 



ATCACGGTGA GTGGGGTAGG GGTGACCAGT GTCCTTCAGA GAAGGGGGGA TGAGAAAGCT 

GC^S^a" CACcicCITC ccccaaggct ttccattcgc agaccctccc ctttatggcc 

70 SUSSES GCACCTTCAG GATGGACCTG GGCATCATCT TGGACCAGCC AGGTATGGAA 
TCGTCCCCTT ATTGAGACTC TGCACGGACA AGGGCCCTAG AGATTGACCC TGCAGTGACT 
cScA^GAC CCCTATACAC TCACTTCGGA GAGGGCCATC TCTGGCGGAG GC^AACTCT 
^CACTTCC GCCCCTCCCT GCTGAGCCAG AGAAGCCCTG GCCATTGTCC GTCACTCCGA 
T^C^cIcG GCCACCCTGT GCGTCCCGCC GGTCGCCCCT TACCCCTGGC TCGCCCCTTC 
75 GCCCTTAGAT GGCCAGTTCT ACCAAAGATG GGTTCCGCTG CATGATCCCC GAGACACCOG 
^SSScC AAGGGTTTCA TTAAGGTCAC CTTGTCCGTG AGGGCGCGCG GGGACCTGCC 

SSSSS ctaScccgg ccccagggca ctgttcggac atcgagaagt gagccggggt 

GAGGTCGGGA GGAGGACATG GATCCGGGGG TGGCCGTGGG GCGCGGATAA GGGGAGGGGC 

252 



60 
120 



Seq ID NO: 65 DNA sequence 
50 Nucleic Acid Accession EOS sequence 
Coding sequence: 2932-4764 

1X 21 31 41 51 

55 AGAAGAAAGC IcGTAGGGGC TGGGAGAGGG TACCACAGGG GAGTATGATC TACTTGGGGG 
C^A^GG TTCCCTGAGG AAATAGTACC TGAACTTAGA CTTGAAGGAT AACAGATOrT 
AACTGGGAGG AGAGAATGTT CCAGGCAGAG GAAAAGGCAT ATGCAAAAGT CCAGCGCCTT 180 
« AGCTGGGGTO CCTGGAGTGA GATGGAGCTG GAAAGATCCA GGTG^AGCTT 240 
GACCTGAAGT ACCAGCCCCC AGAGGGCGCT ACTGGAGCCT GGTCAGAGGA GQACTTTGGG 

60 G^Sc^CC AGGACAGCTT CGAGTTAATC ATCCCCAATG TGGGCTTCCA GGAACTGGAG 
ccSIg? CCCAGCTGGA GCGGCGGGCA GTGGCTCTAG GCCGCAGGCT agctcgaagt 
c^ggccagc aggacgatga agagaatgag ctggagcttg agctggagca ggacctggat 
St^otg acgtggaact ttctggtgtt atgttcagcc ccctcaagag ccocgccagg 

GC^Sc ATGGGGATCC CTTCCAGGTG TCCAGAGCTC AAGACTTCCA GGTGGGAGTC 
65 aSg^ctgg aagcccagaa actggtggga GTCAACATTA ACCCCTATGT GGCCGTGCAA 
S^C AGCGCCGTGT GACCGCCACA CAGCGTGGGA CCAGTTOCCC CTTCTACAAT 
GAGTACTTCT TGTTCGAATT TCATGACACG CGGCTTCGTC TCCAAGACTT GCTGCTGGAG 
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CGAGATCCCA 
GCGCGGGGTQ 
GGGGCTTCCC 
GCGCGTCCTG 
GGTGAGCGCC 
CCCGCCGCTG 
GGCACTCGCT 
GTTTAACCCT 
GCTCCGGGAT 
CCGCCTTCTG 
CCAGGCCCAG 
AAGGGATCAG 
GCCTGAGATC 
TGTCCTGGCG 
CGACCCCACC 
CCGAACCCCT 
GCGCAGGCCG 
GTGCAGCCGT 
AAGAACTGGG 
GCTTGCCCCT 
CCTGGCGCCT 
TGCAGGAGGT 
AGGCACTGGA 
CGATGACCCG 
ACCTTTTGGC 
AGAAGGTGGC 
CCGGGGCAGC 
GCAAGCAAGC 
CCGGGCTGCC 
GATGAACTCC 
GAGCCTCCAT 
CTGGGCAGGG 
GAGTTGCAGT 
TTTCAACTCA 
GAGCCCCAGG 
AACGTGTGCC 
CTTGGTCGTG 
CAAGGTGTGG 
GTCAGGCATC 
CCGGAGCAGC 
GGCCACACAG 
GGTCATGAGG 
AAGGGACCTC 
CAGCTCCTGA 
GTCAACCTCA 
ACTGTGTCCC 
GAATTTAACC 
CAGGATGGAG 
TTCCTCATTT 
ATCCCACAGA 
CTGGCTCCTG 
GAGCGGCAGG 
ATCCTGGAGC 
CATGACCTCG 
TTCTATAGCC 
CAGGGCCCAC 
TTGTGTCTCC 
ACCAAGACAT 
TCACTTGACC 
AGTGGCACGA 
TCAGCCTCCC 
TTTTCGCTCT 
ACCTCCTGGG 
TGGGCCCCCA 
TCCAGGCTGG 
CTGGGATTAC 
TCATCTGCAA 
TTTTTTTCAT 
ACCTAGACCT 
CAGCCCCAGC 
ATGCATTCTG 
CTGAATACCG 
CCCTGCCCCC 
CCACCCCCAG 
TGGAACAOGG 
GACATCTATG 
CACTTCAACT 
TACCTGCCCA 



GTTTCTCCCC CCCCGCTCGG 
CCCGCCGAGA GGCCATGGGC 
GCGCTGCGCC TGGGGCTGCT 
GTGGAGCCCT ATGTGCGGGT 
GAGGCGGCGG CGCCCGAATG 
ACGCGCAGCC TCCGCCTGCA 
ACGCACGTGC CGGACCTGAG 
ACCTTCGGCC OGGCCTGGGT 
AGTCTTCAAG GTCTCAACGA 
CTGGCTGTGT CCATGCAGGT 
CAGGGGTCCA CGTTGTCCCG 
ACCCCAAAGG CGGTTCCGCA 
CCCCGTGCCA TGGAGGTGGA 
CCCTGTGAAG ATTTCCTGCT 
GTGGCCTCCC AGCCCATCAG 
GAGTGCCATT TCAGACCTTA 
TCTGGAGGAG CAATTGGGCC 
GGAGGCTCAG CCTCTGCTGG 
GACCCATGCT CAGCGGCCTG 
CTGTCACTGC AAGCCATGCA 
GCAGAGCAGC AACTGCGTGC 
TGAGAGACTG CAGCGCAAGC 
AGTACTGGTG GCTGGGAGCA 
GCCCAATGCC CTGGATCGAT 
TAAGCAAGGA CTGCGACTTC 
ACTGGCCAAG AAGCTCCTGG 
CCCTGGTGAG GTCTGTGCCA 
CAAGGCCTGC ACCTCTGAGC 
CTCCAGCCTA CACCGGGACG 
TGGTATTTGA GCAGTTGATC 
TAGTGATCAT CAATGTATTT 
CACTGGCCGC CCCAAGGGTA 
TCTTCCCCCT GAGGAAGGGA 
TTGAACTAGA CTACAGTGGC 
ATCTGGCACC CCTGGTTGAG 
CAGTGCTCAG GGAGTTCCGT 
TGCATCTGCT CGAGGTGGAG 
AGTCTGAGGT CCTGGCCAGC 
TGACAGTGGT CTTCAAAGAC 
CTTACTTGCA GCCTCCACTC 
TCCTTGTGGG TTCCCACATT 
ATCCTCCTGA GGAGGAAGGA 
AAGGACAGAA GTCCCTGGAT 
AGCACAACTT TGATGAAGAT 
TTTCTATGGT TGGGGAGATC 
CAAAAAAAGC AGTTGCCACC 
ACTTTGAAGA CTGGCTGAAT 
GTGGAGAAGA GGAAGGATCT 
ACCCTGAATC AGAGGCAGTG 
ACCGGCCCAT CAAGCTCCTG 
CAGACCCCAA TGGCAAAGCA 
ACACCAAGGA ACGCTACATC 
TAAGCATCTC TCTCCCAGCT 
TGGGTTCTGA CGACCTCATC 
ACCACAGAGC AAACTGTGGG 
AGGAGCCATT CTGAGTTTCT 
AGATCCTATG ATTTCATGGA 
AGCTCAGTGG TCAAGACTTG 
GCTTTCTTTT tttatgaaac 
TCTCGGCTGA CTGCAACCTC 
CAGTAGCTGG GATTACAGGC 
GTCTCCCAGG TTGGAGTGCA 
TTCAAGCGAT TCTCCTGCCT 
TGTCCAGCTA ATTTTTATAT 
TCTTGAACCC CTGACCTCAA 
AGGTGTGAGC CACCATGCCA 
AACAGAGGTA ATAAGATCAG 
TCATTGTTAT CATAAATGAG 
CATGTCTTCA TGACGTCATC 
TGACGCATCT TCTTTTTCTG 
GCCTTOGCAG ATGCTGGCGG 
AGCCGGTGCT GTCAAGGTGG 
AGGGATCTCT TCACATGTGG 
TTGACATCAA GCCTCGGCAG 
AGGATGTGGT TCTGGATGAC 
TGAAGAGCTG GGTGAAGGGG 
CCCTGACTGG GGAGGGGAAC 
CGGAGCGGGA GGTGAGCGTC 



TGCCCCCTCC 
GCGGCTCCGC 
GGGCAGCCTG 
GTCTTTCCTG 
GAACGAGCAG 
GCTGCGGGAC 
GCGGATCTCC 
GCCCCTCTAT 
AGGCGTTGGC 
GTTGGAAGGG 
GCTCACCCGA 
GCACTTGGAC 
GGTGGAGGAG 
TTTCGGTGTG 
CTTCGAGATC 
GAACCCTGGA 
GAGGGTCCAG 
GAGCCAGGCC 
AGCCCATGGA 
TGCATGTGTG 
GCAAAGTGGC 
CGGGGCCTGG 
GACAGTTTTG 
GCCGAGGGAA 
TACGCGGCCT 
CAAAACTGCG 
AGCTGGAGCT 
TGCCCCCGGA 
GTCCTGGAGC 
GTGGATGGGA 
GACCACAATA 
AAGCTGATGG 
CCCTGGGCAG 
CGACTTGAGC 
CCCCACTCTG 
GTTGAGGTGC 
CAGCCCCAGG 
TACCGTGAGA 
ACAGCTCCTC 
AGCATCTTGG 
GTCCCCCACA 
GAGATGGAGG 
CCCTTCTTGG 
GAAATGGATG 
CAAGACCAGG 
CTGAAGATCT 
GTGTTTCCTC 
GGACACCTTG 
TTGTTCTCTG 
GTCAGAGTGT 
GACCCTTACG 
CCCAAGCAGC 
GAGACGGAGC 
GGGGAGACCC 
CTGGCCTCCC 
GGCCAAACAC 
AGGGGACCCT 
GGCTTGGGAG 
AGTCTCGCTC 
CACCTCCTGG 
GTGGGCCCCC 
GTGGCACGAT 
CAGCCTCCCC 
TTTTAGTAGA 
GTGATCCACC 
GGCCCTCTTA 
TATCTTCTTA 
GACTAACCTG 
ACTGCCCCAG 
CCTGTAGAGA 
GGCTGTGCCA 
GCAGCAAAGT 
ATTGACATCT 
CCAATCAGCT 
GAGAATCCAC 
TTGGAGCATG 
TTCAATTGGC 
TGGCGCAGGT 



CCTAGGAACC TGCTCCTGCC 
GTGCGCCTGT ACCGCGCCGA 
GTCCGCGCCC TGCACGACCA 
GGGCAGGAGG GCGAGACGTC 
CTGAGCTTCG TGGAGCTCTT 
GACGCGCCCC TGGTCGACGC 
CATCCGGGCC GCGCGGCGGG 
GGCTCGCCCC CCGGCGCGGG 
CAAGGCATTT GGTTCCGCGG 
AGAGCTGAAC CTGAGCCTCC 
AAGAAGAAAA AGAAAGCCAG 
GCCAGCCCCG GTGCCGAGGG 
CTGCTGCCGC TGCCAGAGAA 
CTCTTCGAGG CCACCATGAT 
TCCATTGGTG TGTGGCCTAG 
AGGGGTGTTG ACTTTCAGTC 
GGCTGGGGAG GGAACTGAGG 
AGAGGAGGAG AAAGAGGAGG 
CGGCAGTGGG CCATACTTCT 
GAGTTGCTGG GAGGACCACA 
CGAGAGGCTG GACCAGGGGC 
CGCCTGTGCA CAGCTCAAGC 
CCACGGTGCC GAGCGCAGGA 
ACTCCTGGTG CACAGCCTGA 
GAGACGGCGC AATGTGCAAA 
CTTTCTGGCT GAGGAGGCAC 
CTTCCTGCGG CTGGGCCTGG 
TTTGCTGCCC GAGCCCTCAG 
AGACGCTGAG CCCTCTGTGG 
GGAGGGAGCA CCTGCAGGAG 
AGTTTGGCCC CCCCGTGTTC 
AGGACCCATA CCAACGCCCA 
CCGGAGAGCT CATTGCCGCC 
CCTCAGTGCC CAGTGAGGTG 
GACGCCTGTC CCTTCCACCC 
TGTTCTGGGG TCTTAGGGGA 
TTGTACTGGA GGTGGCTGGG 
GCCCCAATTT CACTGAGCTT 
TCTTCCACCC CCAGGACTTG 
TGATTGAGCG CCGGGCCTTT 
TGCTGCGATT CACATTTCGG 
AGACAGGGGA TATGATGCCC 
CTGAAGCGGG TATATCCAGA 
ATCCTGGAGA TTCAGATGGG 
GTGAGGCTGA AGTCAAAGGC 
ACAACAGGTC CCTGAAGGAA 
TGTACCGAGG GCAAGGGGGC 
TGGGCAAGTT CAAGGGCTCC 
AGCCCCAGAT CTCCCGGGGG 
ATGTTGTAAA GGCTACCAAC 
TGGTGGTGAG CGCTGGCCGG 
TCAACCCCAT CTTTGGAGAG 
TGACGGTCGC CGTATTTGAT 
ACATTGATCT GGAAAACCGA 
AGTATGAAGT GTGGGTCCAG 
ATTCAAGCTC ACATTCCCTT 
CCCACCCACC GCCACTGCCA 
TCGGGATCCT GTAACGAATG 
TGTCTCCCAG GTTGGAGTGC 
GTTCAAGCGA TTCTCCTGCC 
ATGTCCAGCT AATTTTTATA 
CTCGGCTGAC TGCAACCTCC 
AGTAGCTGGG ATTACAGGCG 
GACAGGGTTT CACCATGTTG 
CACCTCTGCC TCCCAAAGTG 
ACCTCTTCAA GTCTGTTTTC 
ATGGAAGCAC CTGGACTACA 
TCTCCCGTTG GGAGTTTTGA 
GCCCAGCTGT GTCCCTACAC 
TGGTTACAAT GCCTGGCGTG 
ACGCTGTGGC CTCCCTGCCC 
CTTCCTGACA CCACCGGAGA 
TTCCTCAAGA TGTGCCTGCT 
ATGAGCTCAG AGTTGTCATC 
TCACCGGAGA gatgtcgagt 
ACAAGCAGGA GACAGAOGTT 
GCTTTGTGTT CCGCTTTGAC 
CTGGACCCTT TGCCCTGGAG 
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GAGGCGGAGT 
CAGACATGGT 
GGGCCGGGCC 
TGAAGCTGAA 
AGAAGCGAAA 
GTGGCAATGT 
AGGAGGCCGA 
CCAGCCGCCC 
TCATCTGGCG 
TCCTCCTCCT 
TCCACAAGTG 
GCTCCTGGAA 
AAACCTATCA 



TCCGGCAGCC 
GCGTGGGGCC 
GAGGTGCAAT 
GGAGGCAGAG 
GCAGAGGAGG 
GTACATCCTC 
GAAAOGGCCA 
CAAAACTTCC 
CCGGTACTGG 
GGTCTTCTAC 
ACTCTCGCTG 
GTCTTTCTTA 
CAGCC 



TGCAGTGCTG 
CGGGGCCCCG 
CTGTTTCGCT 
GACGTGGAGC 
AGGAAGGGCC 
ACGGGCAAGG 
GTGGGGAAGG 
TTCAACTGGT 
OGCACCCTGG 
ACCATCCCTG 
ACCTTGGACA 
CCCATGTGAG 



GTCCTGCAGG 
AGCTCTGCTC 
GCCGCCGCCT 
GGGAGGOGCA 
GGCCAGAAGA 
TGGAGGCAGA 
GGCGGAAGCA 
TTGTGAACCC 
TGCTGCTGCT 
GCCAGATCAG 
CTCACCCAGG 
CTACCCCAGA 



ATCCCTGGAG 
TGTGCAGCTG 
GAGGGGCTGG 
GGAGGCTCAG 
CCTGGAGTTC 
GTTTGAGCTQ 
GCCAGAGCCT 
GCTGAAGACC 
ACTGGTGCTG 
CCAGGTCATC 
GTGCCAACCC 
GTCTAGTGCT 



TTGCAGCTAC 
GCCCGCAATG 
TGGCCGGTAG 
GCTGGCAAGA 
ACAGACATGG 
CTGACTGTGG 
CTGGAGAAAC 

CTCACCGTCT 
TTCCGTCCCC 
TTCAATGCCT 
TCCTCTGAAT 



Seq ID NO: 66 Protein 
Protein Accession Bj 
1 11 

I I 

MCKRRWHWPR SSWQNCAFWL 
SPQPGCPPAY TGTVLEQTLS 
PVFLGRALAA PRVKLMEDPY 
SEVEPQDliAP LVEPHSGRLS 
VAGQGVE9EV LASYRESPNF 
RAFGHTVLVG SHIVPHMLRF 
ISRQLLKHNF DEDEMDDPGD 
LKEEFNHFED WLNVFFLYRG 
SRGIPQNRPI KLLVRVYWK 
FGEILELSIS LPAETELTVA 
WVQQGPQEPF 



sequence t 
EOS sequence 
21 31 

I I 
RRHPGQPLVR SVPSWSSSCG 
PLWDELLVFE QLIVDGRREH 
QRPELQFFPL RKGPWAAGEL 
LPPNVCPVLR EFRVEVLFWG 
TELVRHLTW FKDTAPLFHP 
TFRGHEDPPE EEGEMEETGD 
SDGVNLISMV GEIQDQGEAE 
QGGQDGGGBE EGSGHLVGKF 
ATNLAPADPN GKADPYVWS 
VFDHDLVGSD DLIGETHIDL 



41 
I 

WAHASKPRPA 
LQEEPPLVII 
IAAFQLIELD 
LRGLGRVHLL 
QDLPEQPYLQ 
MMPKGPQGQK 
VKGTVSPKKA 
KGSFIiIYPES 
AGRERQDTKE 
ENRFYSHHRA 



51 
I 

PLSCPRICCP 
NVFDHNKFGP 
YSGRLEPSVP 
EVEQPQWLE 
PPLSILVIER 
SLDPFLAEAG 
VATLKIYNRS 
EAVLFSEPQI 
RYIPKQLNPI 
NCGLASQYEV 



6180 
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60 
120 
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Seq ID NO: 67 DNA sequence 

Nucleic Acid Accession ft: NMJ)02449.2 

Coding sequence: 223.. 1026 



GGGGGGGGGG 
GAGGCCCGGC 
TCCCAGCGCG 
CAGCGGAGTC 
GGCAATGACT 
GGGCTGGGGG 
TTCAGCGTGG 
CCCGAAGGCG 
CGGGAAGCGC 
GGAAATTCAG 
AGACATATGA 
ACGCCCTTTA 
CTCTCCATTG 
AAAATCTGGT 
AAGCTGAAAA 
AGCTCGCCCC 
CTTCCCATCC 
TCCTAAGGAA 
TCCCTCrCCA 
ACCCCACCCC 
GTAGGCGGAG 
CCCTTGCACA 
AAAAATACTA 
AAATTTATAT 
AAGTCCATTT 
GTCTCCAGCC 
GTCAATGATT 
GCCCGCTGTG 
ACAATGCAGC 
CCACGGOGTC 
TATTTCTGCA 
GGTCAAGTCG 
TCCCGCCCAC 
GCCAAGACAT 
CCAATACCTG 
GCGCACGCCC 
ATACTTCCTA 
AGTACCTCTC 
GTTAAAACCT 
TCTGAGAAAT 
TAGTAATTTT 
TGAACAATTA 



11 
I 

GGCAGCCTCT 
GCGGATTGGC 
CCCCTCCCGT 
GCGCGTCGGG 
TGTTTTCGCC 
GCGCCGCGGG 
AGGCGCTCAT 
CCTCGGCCGG 
ACAGCCCCGG 
AAGATGGAGC 
GCCCTACCAC 
CCACATCCCA 
CAGAGCGTGC 
TCCAGAACCG 
TGGCTGCAAA 
TGCAGGCAGC 
CGCCCGTGGG 
GACCAGATCA 
CAAAGGCATA 
CTCTAACGGC 
GCACCAAGCC 
AAAGCTTGGC 
CAGACCCTTT 
ACATATGTGC 
GTTGTATTTT 
TGCCCTTCAG 
TTTACCGGGA 
CCTCCCGAAG 
AGCCAGAAAA 
CGGGAAGCGC 
TTTTACTTGC 
GGAAATTCAG 
GCCCACACAT 
ATGAGCCCTA 
GGATGAGAAG 
TTTACCACAT 
ATTCTTCTCA 
CATTGCAGAG 
GTGAGAGGAT 
ACTAGGTACA 
CTGCTTAAAA 
GTAGATCCAG 



21 



31 



CGGGAAGAGC 
GGCGCGCGTC 
CTCCGCAGCA 
AGCTACGTAG 
CGACGAGGAG 
GGCCGCGGAG 
GTCCGACAAG 
GGCCCACCTG 
GCCGCTGGTG 
GGCGTGGATG 
CTGCACCCTG 
GCTCCTCGCC 
AGAGTTCTCC 
AAGCGCCAAG 
ACCTATGCTA 
GTCCATATAC 
ACTCTATGCC 
ATAGACTCCA 
GCCAGCCAGT 
TAGCTGACAG 
CTGCTTTTCT 
TCTGATGGTT 
TTATCAGCAG 
TTTTTTTCTA 
CTTAAAGAGG 
CGTGGAGGCG 
TTATTGACTT 
GCGCCTCGGC 
TATTAGATAT 
ACAGCCCCGG 
ATTAAAAGAA 
AAGATGGAGC 
ATTTTTAAAG 
CCACCTGCAC 
AATCOGTAGA 
CCCAGCTCCT 
AGGCTGTTGG 
CGTGCAGAGT 
TGTACAGCTC 
TTCATCCTCA 
AATGAGTATC 
AAAGAAAAAA 



CAATCAGGGG 
TCCCACTTCC 
AAAAAGTTTG 
GGCAGAGAAG 
GGCCCAGCAG 
GAGCGCCGCG 
AAGCCGCCCA 
CGGCCACTGC 
AAGCCCTTCG 
CAGGAACCCG 
AGGAAACACA 
CTGGAGCGCA 
AGCTCTCTGA 
GCGAAAAGAC 
CCCTCCAGCT 
GCAGCATCCT 
ACGCCAGTGG 
TGATGGATGC 
ACTCCTGCGC 
GGCCACACCA 
TGGTGTAACT 
TTTTTGGCAT 
ACGTAAAAAT 
TATCTCACCT 
GAGACAAATT 
CTCATGTCCG 
CTGCTTATAC 
CGGCCTGCGG 
GGAGAGATTA 
GCCGCTGGTG 
ACCTCTTTAT 
GGCGTGGATG 
TTTTTAGGAA 
CCTGACCTTT 
CTGCCGGAAA 
CGCCCTGGAG 
TAACTTTGGA 
TCTCTATTTC 
TCTGAACCTC 
CAGATTGCAA 
TTGTAACCAT 
AAAATATGCT 



41 
I 

CGAGCGTCTT 
CCTCGGAGGA 
AGTCGCCGCT 
TCATGGCTTC 
TGGTGGCCGG 
TCAAGGTCTC 
AGGAGTCGCC 
TGCTGTCGGG 
AGACCGCCTC 
GCCGATATTC 
AGACCAATCG 
AGTTCCGTCA 
ACCTCACAGA 
TGCAGGAGGC 
TCAGTCTCCC 
ACCCGTTCCA 
GATATGGCAT 
TTGTTTCAAA 
TGCTAAGCCC 
CATAGCTGAA 
TCCAGAGTCC 
GATGTATATA 
TCAAATTATT 
TCCCAAAAAG 
CGGAGGAGCG 
ATTTGCAAAA 
AAGAAGCCGC 
AAAAACAAAA 
TGGCCACTGC 
ATCAAAGTGA 
AAGCCCTTCG 
CTACATACGT 
CCCGGCCGAT 
TTAAGAATAT 
CACAAGACCA 
GTGAGGTAGA 
GCGCAAGTTC 
AGATAATTGG 
ACAGACCCAG 
AGGTGCTTTG 
TACCTATATC 
TCTCTGTGTG 



51 
I 

CTCGTCGCAC 
AAGGCTCAGC 
GCCGGGTTGC 
TCCGTCCAAA 
ACCAGGCCCG 
CAGCCTGCCC 
CGCTGTGCCT 
GCACCGCGCT 
GGTCAAGTCG 
GCCGCCGCCA 
GAAGCCGCGC 
GAAACAGTAC 
GACCCAGGTC 
GGAACTGGAA 
CTTCCCCATC 
TAGACCTGTG 
GTACCACCTG 
GGGTTTCCTC 
TCGACGTTGC 
ATTTCGTTCT 
CCCCTTTTTT 



TATATATACG 

TTAAAAGGCA 

ACACATGTGT 

CCGCGTCAAG 

ATGTGCTAAA 

CCAAGGAGTC 

GAAAACAGAC 

TGCTGACCGG 

ACCCACATCA 

AGACCGCCTC 

TGTTCCTATC 

ATTCGCCGCC 

TTTTGTAAGA 

ATCGGAAGCC 

AAAATTAGAA 

CGTCAGAAAC 

AGAGTAAAAT 

GTCAAAAGGT 

GGTGGGGGTT 

TAAATATTCT 

TGTACCTGTT 
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GTATGTCCTA ACTTATTAGA AAAATTTTAT ATCTTTTTAC ATGTGGGGGG CAGAAGGTAA 2 580 

AGCATGTTTG ACTTGTGAAA ATGGGATGTC AAACAGCCAT AAGTTCCCTG GT ATTCA CCT 2640 

TCCTGTCCAT CTGTCCCCTC CATCGGTATA CCTTTATCCC TTTGAAAGGG TGCTTG TACA 2700 

ATTTGATATA TTTTATTGAA GAGTTATCTC TTATTCTGAA TTAAATTAAG CATTTGTTTT 2760 
ATTCTGAATT AAATTAAGCA TTTGTTTTAT TGCAGTAAAG TTTGTCCA 

Seq ID NO i 68 Protein Sequence 
Protein Accession fl: NP_002440.1 

1 U 21 31 41 51 

MASPSKGNDL FSPDEEGPAV VAGPGPGLGG AAGAAEERRV KVSSLPFSVE ALMSDKKPPK 60 

ESPAVPPEGA SAGAHLRPLL LSGKRAREAH SPGPLVKPFE TASVKSGNSE DGAAWMQEPG 120 

RYSPPPRHMS PTTCTLRKHK TNRKPRTPFT TSQLLALERK FRQKQYLSIA ERAEFSSSLN 180 

LTETQVKIWP QNRSAKAKRL QEAELEKLKM AAKPMLPSSF SLPFP1SSPL QAASIYAASY 240 
PFHRPVLPIP PVGLYATPVG YGMYHLS 



Seq ID NO i 69 DNA sequence 

Nucleic Acid Accession fl: NM_005603.l 

Coding sequence: 1..3756 



11 



21 



I 



ATGAGTACAG 
GTGGTTCCCT 
GAACCAGAAC 
GAATGTACAT 
AACACAAAAT 
AAGTACAACG 
AATTTATATT 
TGGTACACCA 
GTGGACGATG 
ATTAAGGATG 
CGTCTGAAAA 
AACAGCCTCT 
ATGTCACTTG 
GGTTTTATTG 
TGGAGAAACA 
AGGAACACCG 
AAGAATAGTG 
GTTTACACGA 
TATTGGGAAG 
CCCTCCTACC 
CCCATCTCTC 
TGGGACCTGC 
CTCAATGAAC 
CAAAATATCA 
GATGCCTCTC 
GCTGATGGGA 
GAGCCAGAAG 
AGGACTGATG 
GCTGCCAGGA 
GAACTGGGCA 
AAGCGAATGT 
GCTGACACTG 
GATGCCCTGG 
ATTGAAGAAA 
ACCAACCGGG 
CTGGGAGCTA 
CTTGCAAAAG 
AATATAGGAT 
ATTAATTCTC 
AAGTTTGCAC 
ATCACTGGTT 
CTGAAGCTGA 
AGGCTAGAAG 
AGCGCAGTCA 
AAGAGGTACA 
ATCAAAACTG 
TCGAGTGACT 
CGATGGTCTT 
TTTACTTTGG 
GAGGATTGGT 
GGGCTGCTCG 
GTGGGACAAA 
GTCCTAACAT 
CAGGATGGAG 



AAAGAGACTC 
ACAGTGATGA 
AAAACCGAGT 
GGCAAGTCAA 
TCTTGTGTAT 
CATTTACCTT 
TCCTGGCTCT 
CACTAGTGCC 
TGGCTCGCCA 
GCAGGTTCAA 
AAAATGATTT 
GCTATGTGGA 
AAATCACAGA 
AATGTGAAGA 
CAAGTTTTCC 
ATTTCTGCCA 
GGAAAACCAG 
TCTTTGTTGT 
CACAGGTGGG 
GTGGATTCCT 
TCTATGTCAG 
AAATGTACTA 
AGCTCGGGCA 
TGACCTTTAA 
AACACAACCA 
AGCTTGCATT 
TACGACAGTT 
GTCAGCTCAA 
ACTTTGGCTT 
CTGAAAGGAC 
CTATCATTGT 
TTATTTATGA 
ATATCTTTGC 
AAGAATTTAC 
ACGAAGCTCT 
CAGCTATTGA 
CTGACATTAA 
TTGCTTGTGA 
TTCTTCATGC 
CTCCTGTGCA 
CTTGGTTGAA 
AGTTCCCAAG 
CTAAGAAAGA 
TCTGCTGCCG 
AGAAAGCCAT 
CCCACATTGG 
ATTCCTTTGC 
ACATAAGGAT 
TTCATTTCTG 
TCATCACCCT 
ACCAGGATGT 
GAGACTTACT 
CGATGATCCT 
AGGCACCTTC 



AGAAACGACA 
TGAAACAGAA 
CAACAGGGAA 
AGCAAACGAT 
TAAGGAGAGT 
TATACCAATG 
TCTTATCTTA 
CCTGCTTGTG 
TAAAATGGAT 
AGTTGCTAAG 
TGTTCCAGCT 
AACAGCAGAA 
CCAGTACCTC 
ACCCAATAAC 
TTTGGATGCT 
CGGCTTAGTC 
ATTTAAAAGA 
TCTTATTCTG 
CAATTCCTCT 
CATTTTCTGG 
CGTGGAAGTG 
TGCTGAGAAG 
GATCCATTAT 
AAAGTGCTGT 
CAACAAAATA 
TTATGACCAC 
CTTCTTCTTG 
CTACCAGGCA 
TGCCTTCCTC 
TTACAATGTT 
AAGAACCCCA 
ACGGTTACAT 
AAATGAAACT 
AGAATGGAAT 
GGATAAAGTA 
AGACAAGCTA 
GATCTGGGTG 
ACTTCTGACT 
AAGGATGGAA 
GGAATCTTTT 
TGAAATTCTT 
AACAGAAGAA 
GCAGCGGCAG 
CGTCACCCCC 
CACGCTGGCC 
CGTTGGAATA 
TCAGTTCCGA 
GTGCAAGTTC 
GTACTCCTTC 
CTACAACGTG 
GAGTGACAAA 
ATTCAACTAT 
CTTCTTCATA 
CGACTACCAG 



31 
I 

TTTGACGAGG 
GATGAACTTG 
GCAGAGGAGA 
CGCAAGTACC 
AAATATGCGA 
AATCTGTTTG 
CAGGCAGTTC 
GTGCTGGGCG 
AAGGAAATCA 
TGGAAAGAAA 
GACATTCTCC 
CTGGACGGAG 
CAAAGAGAAG 
CGACTAGATA 
GATAAAATTT 
ATTTTTGCAG 
ACTAAAATTG 
CTTTCTGCTG 
TGGTACCTCT 
GGCTATATCA 
ATTCGTCTTG 
GACACACCCG 
ATCTTCTCTG 
ATCAACGGGC 
GAGCAAGTTG 
TATCTTATTG 
CTCGCAGTTT 
GCCTCTCCCG 
GCCAGGACCC 
CTTGCCATTT 
GAAGGCAATA 
CGAATGAATC 
CTTAGAACCC 
AAAAAGTTTA 
TATGAGGAGA 
CAGGATGGAG 
CTTACTGGAG 
GAAGACACCA 
AACCAGAGGA 
TTTCCACCCG 
CTCGAGAAAA 
GAAAGACGGA 
AAAAACTTTG 
AAGCAGAAGG 
ATCGGAGATG 
AGTGGACAAG 
TATCTGCAGA 
CTACGATACT 
TTCAATGGCT 
CTGTACACCA 
CTGAGCCTCC 
AAGAGATTCT 
CCTCTTGGAG 
TCTTTTGCCG 



41 
I 

ATTCTCAGCC 
ATGACCAGGG 
ACCGGGAGCC 
ACGAACAACC 
ATAATGCAAT 
AGCAGTTTAA 
CTCAAATCTC 
TCACTGCAAT 
ACAATAGGAC 
TTCAAGTTGG 
TGCTGTCTAG 
AAACCAATTT 
ATACATTGGC 
AGTTTACAGG 
TGTTACGTGG 
GTGCTGACAC 
ATTACTTGAT 
GTCTTGCCAT 
ATGATGGAGA 
TTGTTCTCAA 
GACAGAGTCA 
CAAAAGCTAG 
ATAAGACGGG 
AGATATATGG 
ATTTTAGCTG 
AGCAAATCCA 
GCCACACAGT 
ATGAAGGTGC 
AGAACACCAT 
TGGACTTCAA 
TCAAGCTTTA 
CTACTAAGCA 
TATGCCTTTG 
TGGCTGCCAG 
TTGAAAAAGA 
TTCCAGAAAC 
ACAAAAAGGA 
CCATCTGCTA 
ATAGAGGTGG 
GTGGAAACCG 
AGACCAAGAG 
TGCGGACCCA 
TGGACCTGGC 
CCATGGTGGT 
GGGCCAATGA 
AAGGAATGCA 
GGCTACTGCT 
TCTTTTACAA 
ACTCTGCGCA 
GCCTGCCCGT 
GATTCCCTGG 
TTGTAAGCTT 
CTTATCTGCA 
TCACCATTGC 



51 



TAATGACGAA 
GTCTGCTGTT 
ATTCAGAAAA 
TCACTTTATG 
TAAAACATAC 
GAGAGCAGCC 
TACCCTGGCT 
CAAAGACCTG 
GTGTGAAGTC 
AGACGTCATT 
CTCTGAGCCT 
AAAATTTAAG 
TACATTTGAT 
AACACTATTT 
CTGTGTAATT 
TAAAATAATG 
GAACTACATG 
CGGCCATGCT 
AGACGATACA 
CACCATGGTA 
CTTCATCAAC 
AACCACCACA 
GACACTCACA 
GGACCATCGG 
GAATACATAT 
GTCAGGGAAA 
CATGGTGGAT 
CCTGGTAAAC 
CACCATCAGT 
CAGTGACCGG 
CTGTAAAGGT 
AGAAACACAG 
CTACAAGGAA 
TGTGGCCTCC 
CTTAATTCTC 
CATTTCAAAA 
AACTGCTGAA 
TGGGGAGGAT 
CGTCTACGCA 
TGCCTTAATC 
AAATAAGATT 
AAGTAAAAGG 
CTGCGAGTGC 
GGACCTGGTG 
CGTGAACATG 
AGCTGTCATG 
GGTGCATGGC 
AAACTTTGCC 
GACTGCATAC 
GCTCCTCATG 
GTTATACATA 
GTTGCATGGG 
AACCGTAGGG 
CTCTGCTCTT 
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GTAATAACAG TCAATTTCCA GATTGGCTTG GATACTTCTT ATTGGACTTT TGTGAATGCT 3300 

TTTTCAATTT TTGGAAGCAT TGCACTTTAT TTTGGCATCA TGTTTGACTT TCATAGTGCT 3360 

GGAATACATG TTCTCTTTCC ATCTGCATTT CAATTTACAG GCACAGCTTC AAACGCTCTG 3420 

AGACAGCCAT ACATTTGGTT AACTATCATC CTGACTGTTG CTGTGTGCTT ACTACCCGTC 3480 

GTTGCCATTC GATTCCTGTC AATGACCATC TGGCCATCAG AAAGTGATAA GATCCAGAAG 3540 

CATCGCAAGC GGTTGAAGGC GGAGGAGCAG TGGCAGCGAC GGCAGCAGGT GTTCCGCCGG 3600 

GGCGTGTCAA CGCGGCGCTC GGCCTACGCC TTCTCGCACC AGCGGGGCTA CGCGGACCTC 3660 

ATCTCCTCCG GGCGCAGCAT CCGCAAGAAG CGCTCGCCGC TTGATGCCAT CGTGGCGGAT 3720 
GGCACCGCGG AGTACAGGCG CACCGGGGAC AGCTGA 



Seq ID NO: 70 Protein Sequence 
Protein Accession ft: NPJ)05594.1 

1 11 21 31 41 51 

MSTERDSETT PDEDSQPNDE WPYSDDETE DELDDQGSAV EPEQNRVNRE AEENREPPRK 60 

ECTWQVKAND RKYHEQPHFM NTKFLCIKES KYANNAIKTY KYNAFTFIPM NLFEQFKRAA 120 

NLYFLALLIL QAVPOISTLA WYTTLVPLLV VLGVTAIKDL VDDVARHKMD KEINNRTCEV 180 

I KDGRFKVAK WKEIQVGDVI RLKKNDFVPA DILLLSSSEP NSI*CYVETAE LDGETNLKFK 240 

MSLEITDQYk QREDTLATFD GFIECEEPNN RLDKFTGTLF WRNTSFPLDA DKIUiRGCVI 300 

RNTDFCHGLV IFAGADTKIM KNSGKTRFKR TKIDYLMNYM VYTIFWLIL LSAGLAIGHA 360 

YWEAQVGNSS WYLYDGEDPT PSYRGFLIFW GYIIVLNTMV PISLYVSVEV IRLGQSHFIN 420 

WDLQMYYAEK DTPAKARTTT LNEQLGQIHY IFSDKTGTLT QNIMTFKKCC INGQIYGDHR 480 

DASQHNHNKI EQVDFSWNTY ADGKLAFYDH YLIEQIQSGK EPEVRQFFFL LAVCHTVMVD 540 

RTDGQLNYQA ASPDEGAXjVN AARNFGFAFL ARTQNTITIS ELGTERTYNV LAILDFNSDR 600 

KRMSIIVRTP EGNIKLYCKG ADTVIYERLH RMNPTKQETQ DALDIFANET LRTLCLCYKE 660 

IEEKEFTEWN KKFMAASVAS TNRDEALDKV YEEIEKDLIL LGATAIEDKL QDGVPETISK 720 , 

LAKADIKIWV LTGDKKETAE NIGFACELLT EDTTICYGED INSLLHARME NQRNRGGVYA 780 

KFAPPVQESF FPPGGNRALI ITGSWLNEIL LEKKTKRNKI LKLKFPRTEE ERRMRTQSKR 840 

RLEAKKEQRQ KNFVDLACEC SAVICCRVTP KQKAMWDLV KRYKKAITLA IGDGANDVNM 900 

IKTAHIGVGI SGQEGMQAVM SSDYSFAQFR YLQRLLLVHG RWSYIRMCKF LRYFFYKNPA 960 

FTLVHFWYSF FNGYSAQTAY EDWFITLYNV LYTSLPVLLM GLLDQDVSDK LSLRFPGLYI 1020 

VGQRDLLFNY KRFFVSLLHG VliTSMILFFI PLGAYLQTVG QDGEAPSDYQ SFAVTIASAL 1080 

VITVNFQIGL DTSYWTFVNA FSIFGSIALY FGIMFDFHSA GIHVLFPSAF QFTGTASNAL 1140 

RQPYIWLTII LTVAVCLLPV VAIRFLSMTI WPSESDKIQK HRKRLKAEEQ WQRRQQVFRR 1200 
GVSTRRSAYA FSHQRGYADL ISSGRSIRKK RSPLDAIVAD GTAEYRRTGD S 

Seq ID NO: 71 DNA sequence 

Nucleic Acid Accession #: NM_136784.1 

Coding sequence: 3 34-816 

1 11 f j 1 I' f 

GGGGATACGG AAAACAGAAT AGAAGCACAG AACAAGGACT ATTGTATATA AGAGAAGCAG 60 

GTGTTAATGT AAGTTGCTGT GGAATGACTG TTTTCAAGAA GAGAGAAGGA GCCAATATAT 120 

CTACTTTCAT TAAGAAGCTG AAATTAAATT TGGATTTCCT TCTTTCCAAA GCTGAGACAC 180 

GATGGCCAAA AAGCGATTGC TGTGATTGGA GCTGGAATTA GCGGACTGGG GGCCATCAAG 240 

TGCTGCCTGG ATGAAGATCT GGAGCCCACC TGCTTTGAAA GAAATGATGA TATTGGACAT 300 

CTCTGGAAAT TTCAAAAAAA TACTTCAGAG AAAATGCCTA GTATCTACAA ATCTGTGACC 360 

ATCAATACTT CCAAGGAGAT GATGTGCTTC AGTGACTTCC CTGTCCCTGA TCATTTTCCC 420 

AACTACATGC ACAACTCCAA ACTCATGGAC TACTTCGGGA TGTATGCCAC ACACTTTGGC 480 

CTCCTGAATT ACATTCGTTT TAAGACTGAA GTGCAAAGTG TGAGGAAGCA CCCAGATTTT 540 

TCTATCAATG GACAATGGGA TGTTGTTGTG GAGACTGAAG AGAAACAAGA GACTTTGGTC 600 
TTTGATGGGG TCTTAGTTTG CAGTGGACAC CACACAGATC CCTACTTACC ACTTCAGTCC 
TTCCCAGGCA TTGAGAAATT TGAAGGCTGT TATTTCCATA GTCGGGAATA CAAAAGTCCC 

GAGGACTTTT CAGGGAAAAG AATCATAGTG ATCGGCATTG GAAATTCTGG AGTGGATATT 780 

QCGGTGGAGC TCAGTCGTGT AGCAAAACAG ATTTGACCAG TAGGAACTGA GATTCAACGT 840 

CCCCAAGACT GCAAAAGTAC CTACAATCAA ACAAATTAAT ATATTTCCTA ATTGGATCAG 900 

CATGCAGACA AGTTTACAAA TATATTACAG AAAGATTTGA AGAGGGATTC TGGTTACGTT 960 

CAGTTAAGCA GGACACTAAT CCTGAACAAT CTGATGTATT AAAGTTTCTA ATGTTTGTAT .020 
CTTTGGTTTG ATGCATACAA TAGAGTGGAA GTCTGTAGTA GTTAATATGA TGATAATTOG 
CTTCTAAATT TAAATCCTTA TTTATATATG AAGCATAGCT ATGTCAATGT CCAGGGAAGC 

ATTTTTAAAT AGTAAATCAG AAACATGTAA ACTAGTGAAT ACAATTTTCT ATTTTTCATT 1200 

CAGAAATTTG TTGTCATAGA TGCAGCCTTT ATCTGGAAAT CAGTGGGCAG CTC^GTCAA 1260 

GCACAGCATG GCGACCACTG TGTAGTATTT TACCACCTAA TCCTTAGTAG CCAACACTGT 1320 

TAAATCTGCA TAATGGGTAG CATATGAATA AAATAATATC TTGTAAAAAA AAAAAAAAAA 1380 
AAAAAAAAAA AAAA 

Seq ID NOt 72 Protein sequence: 
Protein Accession 8: NP_620139.1 

U 21 31 |1 51 

ipSIYKSVTI NTSKEMMCFS DFPVPDHFPN YMHNSKLMDY FGMYATHFGL LNYIRFKTEV 60 
QSVRKHPDFS INGQWDVWE TEEKQETLVF DGVLVCSGHH TDPYLPLQSF PGIEKFEGCY 120 
FHSREYKSPE DFSGKRIIVI GIGNSGVDIA VELSRVAKQI 
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PCTAJS02/21338 



Seq ID NO: 73 dna sequence 

Nucleic Acid Accession ft: NMJ)03661.l 

Coding sequence i 1..1152 
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ATGAGTGCAC 
CAAAACGTTC 
GCTGCTGGCA 
AAGGAAAAAO 
GGATTCGTGG 
GACAACCTTG 
TACAGAAACT 
AGAAGGCTCC 
AATGTGGTGT 
CTGGCACCCT 
ATCACAGCCG 
ACACAAGCCC 
GAGTTTTTGG 
ACACGAGGCA 
GTACCGCATG 
GAACAGGTGG 
ACGGATGTGG 
TCAAAGCACT 
CAGGAGCTGG 
CAAGAACTGT 



11 

I 

TTTTCCTTGG 
CAAGTGGGAC 
CCATGGACCC 
TGAGCACACA 
CTGCTGCTGA 
CAAGACAAAT 
GGTTTCTGAA 
GTGCCCTTGC 
CTGGCTCTCT 
TCACAGAGGG 
CTTTGACCGG 
AAGCCCACGA 
GTGAGAACAT 
TTGGGAAGGA 
CCTCAGCCTC 
AGAGGGTTAA 
CCCCTGTAAG 
TACATGAGGG 
AGGAGAAGCT 
GA 



21 

I 

TGTGGGAGTG 
AGATACTGGA 
AGAGAGCAGT 
GAATCTGCTA 
ACTGCCCAGG 
GATCATGAAA 
AGAGTTTCCT 
AGATGGGGTT 
CAGCATTTCC 
AGGCAGCCTT 
GATTACCAGC 
CCTGGTCATC 
ATCCAACTTT 
CATCCGTGCC 
ACGCCCCCGG 
TGAACCCAGC 
CTTCTTTCTT 
GGCAAAGTCA 
AAACATTCTC 



31 
I 

AGGGCAGAGG 
GATCCTCAAA 
ATCTTTATTG 
CTCCTGCTGA 
AATGAGGCAG 
GACAAAAACT 
CGGTTGAAAA 
CAGAAGGTCC 
TCTGGCATCC 
GTACTCTTGG 
AGTACCATGG 
AAAAGCCTTG 
CTTTCCTTAG 
•CTCAGACGAG 
GTCACTGAGC 
ATCCTGGAAA 
GTGCTGGATG 
GAGACAGCTG 
AACAATAATT 



41 
I 

AAGCTGGAGC 
GTAAGCCCCT 
AGGATGCCAT 
CTGATAATGA 
ATGAGCTCCG 
GGCACGATAA 
GTGAGCTTGA 
ACAAAGGCAC 
TGACCCTCGT 
AACCTGGGAT 
ACTACGGAAA 
ACAAATTGAA 
CTGGCAATAC 
CCAGAGCCAA 
CAATCTCAGC 
TGAGCAGAGG 
TAGTCTACCT 
AGGAGCTGAA 
ATAAGATTCT 



SI 
I 

GAGGGTGCAA 
CGGTGACTGG 
TAAGTATTTC 
GGCCTGGAAC 
TAAAGCTCTG 
AGGCCAGCAG 
GGATAACATA 
CACCATCGCC 
CGGCATGGGT 
GGAGTTGGGA 
GAAGTGGTGG 
GGAGGTGAGG 
TTACCAACTC 
TCTTCAGTCA 
TGAAAGCGGT 
AGTCAAGCTC 
CGTGTACGAA 
GAAGGTGGCT 
GCAGGCGGAC 
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Seq ID NO: 74 Protein Sequence 
Protein Accession *h NP_003652.l 



l 
I 

MSALFLGVGV 
KEKVSTQNLL 
YRNWFLKEPP 
LAP FT EGG SL 
EFLGENISNF 
EQVERVNEPS 
QELEEKLNIL 



11 

I 

RAEEAGARVQ 
LLLTDNEAWN 
RLKSELEDNI 
VLLEPGMELG 
LSLAGNTYQL 
ILEMSRGVKL 
NNNYKILQAD 



21 
I 

QNVPSGTDTG 
GFVAAAELPR 
RRLRALADGV 
ITAALTGITS 
TRGIGKDIRA 
TDVAPVSFFL 
QEL 



31 
I 

DPQSKPLGDW 
NEADELRKAL 
QKVHKGTTIA 
STMDYGKKWW 
LRRARANLQS 
VLDWYLVYE 



41 
I 

AAGTMDPESS 
DNLARQMIMK 
NWSGSLSIS 
TQAQAHDLVI 
VPHASASRPR 
SKHLHEGAKS 



51 

I 

IFIEDAIKVF 
DKNWHDKGQQ 
SGILTLVGMG 
KSLDKLKEVR 
VTEPISAESG 
ETAEELKKVA 



Seq ID NO: 75 DNA sequence 

Nucleic Acid Accession ft: NM_014452.1 

Coding sequence: 1..196B 



11 



ATGGGGACCT 
GCCACAGCCA 
GCTCAGCCAG 
ACCGGCCAGG 
ACCAACACAA 
AATGGCATAG 
TTACCTTGTG 
AACGCTACCT 
ACAGAGACTG 
TCTAGTGTGA 
AAGCCGGGGA 
ACCTCACCTT 
GTCCCTTCCT 
TCTGTTAGAC 
TCAGCAAGGG 
CAGCAAGGCC 
GGCGAGAAGT 
CACAAGCATT 
GTGCTTGTGG 
CCCCGGCAGG 
ACCCAGAACC 
CTTGTAGCAG 
AGTGAGAGGG 
GCAGCTCTGC 
GCCCTGCGCC 
ACCACCCAGC 
AGCCCCATCC 
TCCCCACAGG 
GACTCTACAT 
AAGAAGGACA 
GATGACATGC 
GCTGAGGACA 



CTCCGAGCAG 
CGATGATCGC 
AACAGAAGGC 
TGCTAACCTG 
GCCTGCGCGT 
AGAAATGCCA 
CTGCCTTGAC 
GTGCCCCCCA 
AGGATGTGCG 
TGAAATGCAA 
CCAAGGAGAC 
CCCCTGGCAC 
CCACTTATGT 
CAAAGGTACT 
GGAAGGAAGA 
CCCACCACAG 
CCAGCACGCC 
TTGACATCAA 
TGATTGTGGT 
ATCCCAGTGC 
GGGAGAAATG 
CCCAAGTGGG 
AGGTTGCTGC 
AGCACTGGAC 
AGCACCGGAG 
TGGAAACTGA 
CCAGCCCCAA 
ACAAGAACAA 
CCAGCGGCTC 
CAGTGTTGCG 
TCCACTTTCT 
AACTAGACCG 



21 
1 

CAGCACCGCC 
GGGCTCCCTT 
CTCGAATCTC 
TGACAAGTGT 
CTGCAGCAGT 
TGACTGTAGT 
TGACCGAGAA 
TACGGTGTGT 
GTGTAAGCAG 
AGCATACACA 
AGACAA CGTC 
AGCCATCTTT 
TCCCAAAGGC 
GAGTAGCATC 
CGTGAACAAG 
ACACATCCTG 
CATCAAGGGC 
TGAGCATTTG 
GTGCAGTATC 
CATTGTGGAA 
GAT CT ACT AC 
AAGCCAGTGG 
TTTCTCCAAT 
CATCCGGGGC 
AAACGATGTT 
CAAACTAGCT 
CGCGAAACTT 
GGGCTTCTTC 
CTCCGCGCTa 
GCAGGTACGC 
AAATCCTGAG 
GCTATTCGAA 



31 



CTCGCCTCCT 
CTCCTGCTTG 
ATTGGCACAT 
CCAGCAGGAA 
TGCCCTGTGG 
CAGCCATGCC 
TGCACTTGCC 
CCTGTGGGTT 
TGTGCTCGGG 
GACTGTCTGA 
TGTGGCACAC 
CCACGCCCTG 
ATGAACTCAA 
CAGGAAGGGA 
ACCCTCCCAA 
AAGCTGCTGC 
CCCAAGAGGG 
CCCTGGATGA 
CGGAAAAGCT 
AAGGCAGGGC 
TGCAATGGCC 
AAAGATATCT 
GGGTACACAG 
CCCGAGGCCA 
GTGGAGAAGA 
CTCCCGATGA 
GAGAATTCCG 
GTGGATGAGT 
AGCAGGAACG 
CTGGACCCCT 
GAGCTGCGGG 
ATTATTGGAG 



41 

I 

GCAGCCGCAT 
GATTCCTTAG 
ACCGCCATGT 
CCTATGTCTC 
GGACCTTTAC 
CATGGCCAAT 
CACCTGGCAT 
GGGGTGTGCG 
GTACCTTCTC 
GTCAGAACCT 
TCCCGTCCTT 
AGCACATGGA 
CAGAATCCAA 
CAGTCCCTGA 
ACCTTCAGGT 
CGTCCATGGA 
GACATCCTAG 
TTGTGCTTTT 
CGAGGACTCT 
TGAAGAAATC 
ATGGTATCGA 
ATCAGTTTCT 
CCGACCACGA 
GCCTCGCCCA 
TTCGTGGGCT 
GCCCCAGCCC 
CTCTCCTGAC 
CGGAGCCCCT 
GTTCCTTTAT 
GTGACTTGCA 
TGATTGAAGA 
TCAAGAGCCA 



60 

120 

180 

240 

300 

3 60 



51 
I 

CGCCCGCCGA 
CACCACCACA 
TGACCGTGCC 
TGAGCATTGT 
CAGGCATGAG 
GATTGAGAAA 
GTTCCAGTCT 
GAAGAAAGGG 
AGATGTGCCT 
GGTGGTGATC 
CTCCAGCTCC 
AACCCATGAA 
CTCTTCTGCC 
CAACACAAGC 
AGTCAACCAC 
GGCCACTGGG 
ACAGAACCTA 
CCTGCTGCTG 
GAAAAAGGGG 
CATGACTCCA 
TATCCTGAAG 
TTGCAATGCC 
GCGGGCCTAC 
GCTAATTAGC 
GATGGAAGAC 
GCTTAGCCCG 
GGTGGAGCCT 
TCTCCGCTGT 
TACCAAAGAA 
GCCTATCTTT 
GATTCCCCAG 
GGAAGCCAGC 
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CAGACCCTCC TGGACTCTGT TTATAGCCAT CTTCCTGACC TGCTGTAG 
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Seq ID NO: 76 Protein Sequence 
Protein Accession fl: NPJ)55267.1 



1 
I 

MGTSPSSSTA 
TGQVLTCDKC 
LPCAALTDRE 
SSVMKCKAYT 
VPSSTYVPKG 
QQGPHHRHIL 
VLWIWCSI 
LVAAQVGSQW 
ALRQHRRNDV 
SPQDKNKGFF 
DDMLHPLNPE 



11 
I 

LASCSRIARR 
PAGTYVSEHC 
CTCPPGMFQS 
DCLSQNLWI 
MNSTESNSSA 
KLLPSMEATG 
RKSSRTLKKG 
KDIYQFLCNA 
VEKIRGLMED 
VDESEPLLRC 
ELRVIEEIPQ 



21 

I 

ATATMIAGSL 
TNTSLRVCSS 
NATCAPHTVC 
KPGTKETDNV 
SVRPKVLSSI 
GEKSSTPIKG 
PRQDPSAIVE 
SEREVAAFSN 
TTQLETDKLA 
DSTSSGSSAL 
AEDKLDRLFE 



31 
I 

LLLGFLSTTT 
CPVGTFTRHE 
PVGWGVRKKG 
CGTLPSFSSS 
QEGTVPDNTS 
PKRGHPRQNL 
KAGLKKSMTP 
GYTADHERAY 
IiPMSPSPliSP 
SRNGSFITKE 
IIGVKSQEAS 



41 

I 

AQPEQKASNL 
NGIEKCHDCS 
TETEDVRCKQ 
TSPSPGTAIF 
SARGKEDVNK 
HKHFDIMEHL 
TQNREKWIYY 
AALQHWTIRG 
SPIPSPNAKL 
KKDTVLRQVR 
QTLLDSVYSH 



51 
I 

IGTYRHVDRA 
QPCPWPMIEK 
CARGTFSDVP 
PRPEHMETHE 
TLPNLQWNH 
PWMIVLFLLL 
CNGHGIDILK 
PEASLAQLIS 
ENSALLTVEP 
LDPCDIiQPIF 
LPDLL 
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Seq ID NO: 77 DNA sequence 

Nucleic Acid Accession #: NM_003105.3 

Coding sequence i 12 3.. 6767 



11 



GGCGCGGGCG 
TGGCCTCGCG 
ACATGGCGAC 
CACTGCTGCC 
CGCCCTTGCC 
TGTGGGCGCG 
AACGGAGCGC 
ATTCCCACAA 
TGGCCCGAGA 
ACTATGGAAA 
GTGAAGCTGT 
CAGACGCTTA 
TTTCCATCCC 
TGGGCTTTGA 
CCTGGATCAT 
AACCAAATAC 
GTACAGATTT 
TTCAGCTTCG 
AGCAGTCTTC 
TTGTCACAAG 
TTGTGTGTGT 
TGAAGTTCTC 
ACACCTTGGT 
TGCAAGGAGT 
CGGTCATCAC 
GATATGGAGA 
GCCTCAGTCA 
CTCCAGGCCT 
TGTACATCTC 
ACACATGGGG 
AGCTAAAATA 
CAGTGTTTGT 
TTGGCTCGAA 
CCTTGGGAGT 
GGAATGAGTG 
GCTTCAATGG 
AGGACTATGA 
TTCCAGATCC 
CTACTTACAG 
ATGTTGAAGC 
TCATTCTGTA 
AGCAGTTGCC 
ACTGTTTGTA 
GCACAGGGCA 
AACCCCTCAG 
ATCCAGATGG 
CTCTGGTCCT 
CTGGGATTTA 
TGAAGTGGCC 
TGGAGTGCAT 
TCCCGCACCC 
AGCTCAGCAT 
AGCTCACGGG 



21 

I I 
GCCTGGAGCC CCGGGAGCGG 
CTGCACATTC TCTCCTGGCG 
ACGGAGCAGC AGGAGGGAGT 
GCCCGGAGCT CTCTGCGAAG 
CCAGGACCGG GGCTTCCTCG 
CGGGGATGCC AGGGGGGCGA 
TGCCCTGCAG CCCGAGCCCA 
TCAGATGGTG GTGCACTGGG 
TAGCCTGGCA TTGGCGAGGC 
ATCATTCAAG AAAATTTCAG 
TATCGCCCAG TTCTACCACA 
TGCCCAGTAC CTCTGGATCA 
ATTTCGGGCA GCTGATCTCC 
CAGGTCCCAC CCCAACAAGC 
GATTCAGGAA CATGTCAAGT 
CATCTACATT GAACGACACG 
CTTCCAGTCC CGGGAAAACC 
GGACAAGTAC ATGTTTGCTA 
TGTCCAGCTC TGGGTCTCCT 
ACATCCTATT AATGAATATT 
CAGCCACAGT AACAACCGCA 
CCTGTCCTTG GAGAACGTGC 
GAGGTATTTT GCAAATGAAC 
CTACATTGCT ACTCTGATTA 
CTTTGACAAA GGGQGA ACCT 
GAAAATCAAT TGTGAGCTTT 
GCTCCTCAAC CTCCAGCTCC 
CATCATCGCC ACTGGCTCAG 
TAGCAGTGCT GGAGCCAGGT 
AGACCACGGC GGAATCATCA 
CAGTACCAAT GAAGGGGAGA 
GTATGGCCTC CTCACAGAAC 
CAAAGAGAAT GTCCACAGCT 
TCCCTGCACA GAGAATGACT 
TTTGCTGGGA CACAAGACTG 
AGAGGACTTT GACAGGCCGG 
GTGTGACTTC GGTTTCAAGA 
GGAATTTTCT GGAAAGTCAT 
GAGAACGAGA GGCTACCGGA 
GCGACTGGAA GGAGAGCTGG 
TGCTGTGAGG AAATCCATCT 
TCTCACCGGG CTACGGGCAG 
TTGGTCCGAC CTGGCCTTGG 
AGAGGTGATC ATCAATTCTG 
CCAGCTGCTT TACTGGGTAG 
CGACTTCCGA CTCACAATCG 
CGTGCCCCAA GAGGGGGTGA 
TCGGAGCAAT ATGGATGGTT 
CAATGGCATC TCTGTGGACG 
AGAGCGGATC ACGTTCAGTG 
CTATGCCATT GCTGTCTTTA 
ATTCCGAGCT TCCAAATACA 
GCTCATGGAC ATGAAGATTT 



31 



41 



51 



CGCGCGCGGT 
GCGGCGCCAC 
CGCGACTCCC 
TCTGGACGCA 
TGGTGCAGGG 
GCCGCGCGGA 
TCAAGGTGTA 
CTGGAGAGAA 
CCAAGAGCAG 
ACAAGTTAAA 
GCCCTGCGGA 
CGTTTGACTT 
TCCTACACAG 
AGCTGTGGAA 
CCTTTTCTTG 
AACCCTCTGG 
AGGAAGTGAT 
CAAAGGTGGT 
TTGGCCGGAA 
ACATCGCAGA 
CCAATTTATA 
TCTATTACAG 
CATTTGCTGA 
ATGGTTCTAT 
GGGAGTTTCT 
CCCAGGGCTG 
GGAGAATGCC 
TGGGAAAGAA 
GGCGAGAGGC 
CGGCCATTGC 
CCTGGAAAAC 
CTGGGGAGAA 
GGCTGATCCT 
ACAAGCTGTG 
TTTTCAAACG 
TGGTCGTGTC 
TGAGTGAAGA 
ACTCCCCTCC 
AGATTTCTGG 
TCCCCTGTCC 
ACCGCTATGA 
CAGTGGCCCT 
ACGTCATCCA 
GCCTGGAGAC 
ATGCAGGCTT 
TCAATTCCTC 
TGTTCTGGAC 
CTGCTGCCTA 
ACCAGTGGAT 
GCCAGCAGCG 
AGAATGAAAT 
GTGGGTCCCA 
TCTACAAGGG 



CCCGGCCCAG 
CTGCAGTAGC 
GTTCCTATTC 
GAGGCTGCAC 
CGACCCGCGC 
CGAGAAGCCG 
CGGACAGGTT 
AAGCAACGTG 
TGATGTGTAC 
CTTTGGCTTG 
CAACAAGCGG 
CTGCAACACT 
TAAGGCCrCC 
GTCAGATGAC 
GGGAATTGAT 
CTACTCCACT 
CCTTGAGGAA 
GCATCTCTTG 
GCCCATGAGA 
TGCCTCCGAG 
CATCTCAGAG 
CCCAGGAGGG 
CTTCCACCGA 
GAATGAGGAG 
TCAGGCTCCA 
TTCCCTTCAT 
CATCCTGTCC 
CTTGGCTAGC 
ACTTCCTGGA 
CCAGGGCATG 
ATTCATCTTC 
GAGCACTGTC 
CCAGGTCAAT 
GTCACCATCT 
GCGGACCCCC 
CAACTGCTCC 
TTTGTCATTA 
TGTGCCTTGC 
GGACACTTGT 
CCTGGCAGAA 
CCTGGCCTCG 
GGACTTTGAC 
GCGCCTCTGT 
AGTAGAAGCT 
CAAAAAGATT 
TGTGCTTGAT 
AGACTGGGGA 
TCACCTGGTG 
TTACTGGACG 
CTCTGTCATT 
CTACTGGGAT 
GATGGAGATT 
GAAGAACACT 



CCGGCTCTCC 
GTTCGCCCGA 
ACCCTGGTCG 
GGCGGCAGCG 
GAGCTGCGGC 
CTCCGGAGGA 
AGTCTGAATG 
ATCGTGGCCT 
GTGTCTTACG 
GGAAATAGGA 
TACATCTTTG 
CTTCAAGGCT 
AACCTTCTCT 
TTTGGCCAGA 
CCCTATGACA 
GTCTTCCGAA 
GTGAGAGATT 
GGCAGTGAAC 
GCAGCCCAGT 
GACCAGGTGT 
GCAGAGGGGC 
GCCGGCAGTG 
GTGGAAGGAT 
AACATGAGAT 
GCCTTCACGG 
CTGGCTCAGC 
AAGGAGTCGG 
AAGACAAACG 
CCTCACTACT 
GAAACCAACG 
TCTGAGAAGC 
TTCACCATCT 
GCCACGGATG 
GATGAGCGGG 
CATGCCACAT 
TGCACCCGGG 
GAGGTTTGTG 
CCTGTGGGTT 
AGCGGAGGAG 
GAGAACGAGT 
GGAGCCACCG 
TATGAGCACA 
TTGAATGGAA 
TTGGCTTTTG 
GAGGTAGCTA 
CGTCCCAGGG 
GACCTGAAGC 
TCTGAGGATG 
GATGCCTACC 
CTGGACAACC 
GACTGGTCAC 
CTGGCAAACC 
GGAAGCAATG 



60 
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960 
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1200 
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1680 
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1800 

1860 

1920 
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2040 

2100 
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3060 
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258 



WO 03/003906 



CCTGTGTGCC CAGGCCATGC AGCCTGCTGT GCCTGCCCAA GGCCAACAAC AGTAGAAGCT 3240 

GCAGGTGTCC AGAGGATGTO TCCAGCAGTG TGCTTCCATC AGGGGACCTG ATGTGTGACT 3300 

GCCCTCAGGG CTATCAGCTC AAGAACAATA CCTGTGTCAA AGAAGAGAAC ACCTGTCTTC 3360 

GCAACCAGTA TCGCTGCAGC AACGGGAACT GTATCAACAG CATTTGGTGG TGTGACTTTG 3420 

ACAACGACTG TGGAGACATG AGCGATGAGA GAAACTGCCC TACCACCATC TGTGACCTGG 3480 

ACACCCAGTT TCGTTGCCAG GAGTCTGGGA CTTGTATCCC ACTGTCCTAT AAATGTGACC 3540 

TTGAGGATGA CTGTGGAGAC AACAGTGATG AAAGTCATTG TGAAATGCAC CAGTGCCGGA 3600 

GTGACGAGTA CAACTGCAGT TCCGGCATGT GCATCCGCTC CTCCTGGGTA TGTGACGGGG 3660 

ACAACGACTG CAGGGACTGG TCTGATGAAG CCAACTGTAC CGCCATCTAT CACACCTGTG 3720 

AGGCCTCCAA CTTCCAGTGC CGAAACGGGC ACTGCATCCC CCAGCGGTGG GCGTGTGACG 3780 

GGGATACGGA CTGCCAGGAT GGTTCCGATG AGGATCCAGT CAACTGTGAG AAGAAGTGCA 3840 

ATGGATTCCG CTGCCCAAAC GGCACTTGCA TCCCATCCAG CAAACATTGT GATGGTCTGC 3900 

GTGATTGCTC TGATGGCTCC GATGAACAGC ACTGCGAGCC CCTCTGTACG CACTTCATGG 3960 

ACTTTGTGTG TAAGAACCGC CAGCAGTGCC TGTTCCACTC CATGGTCTGT GACGGAATCA 4020 

TCCAGTGCCG CGACGGGTCC GATGAGGATG CGGCGTTTGC AGGATGCTCC CAAGATCCTG 40B0 

AGTTCCACAA GGTATGTGAT GAGTTCGGTT TCCAGTGTCA GAATGGAGTG TGCATCAGTT 4140 

TGATTTGGAA GTGCGACGGG ATGGATGATT GCGGCGATTA TTCTGATGAA GCCAACTGCG 4200 

AAAACCCCAC AGAAGCCCCA AACTGCTCCC GCTACTTCCA GTTTCGGTGT GAGAATGGCC 4260 

ACTGCATCCC CAACAGATGG AAATGTGACA GGGAGAACGA CTGTGGGGAC TGGTCTGATG 4320 

AGAAGGATTG TGGAGATTCA CATATTCTTC CCTTCTCGAC TCCTGGGCCC TCCACGTGTC 4380 

TGCCCAATTA CTACCGCTGC AGCAGTGGGA CCTGCGTGAT GGACACCTGG GTGTGCGACG 4440 

GGTACCGAGA TTGTGCAGAT GGCTCTGACG AGGAAGCCTG CCCCTTGCTT GCAAACGTCA 4500 

CTGCTGCCTC CACTCCCACC CAACTTGGGC GATGTGACCG ATTTGAGTTC GAATGCCACC 4560 

AACCGAAGAC GTGTATTCCC AACTGGAAGC GCTGTGACGG CCACCAAGAT TGCCAGGATG 4620 

GCCGGGACGA GGCCAATTGC CCCACACACA GCACCTTGAC TTGCATGAGC AGGGAGTTCC 4680 

AGTGCGAGGA CGGGGAGGCC TGCATTGTGC TCTCGGAGCG CTGCGACGGC TTCCTGGACT 4740 

GCTOGGACGA GAGCGATGAA AAGGCCTGCA GTGATGAGTT GACTGTGTAC AAAGTACAGA 4800 

ATCTTCAGTG GACAGCTGAC TTCTCTGGGG ATGTGACTTT GACCTGGATG AGGCCCAAAA 4860 

AAATGCCCTC TGCTTCTTGT GTATATAATG TCTACTACAG GGTGGTTGGA GAGAGCATAT 4920 

GGAAGACTCT GGAGACCCAC AGCAATAAGA CAAACACTGT ATTAAAAGTC TTGAAACCAG 4980 

ATACCACGTA TCAGGTTAAA GTACAGGTTC AGTGTCTCAG CAAGGCACAC AACACCAATG 5040 

ACTTTGTGAC CCTGAGGACC CCAGAGGGAT TGCCAGATGC CCCTCGAAAT CTCCAGCTGT 5100 

CACTCCCCAG GGAAGCAGAA GGTGTGATTG TAGGCCACTG GGCTCCTCCC ATCCACACCC 5160 

ATGGCCTCAT CCGTGAGTAC ATTGTAGAAT ACAGCAGGAG TGGTTCCAAG ATGTGGGCCT 5220 

CCCAGAGGGC TGCTAGTAAC TTTACAGAAA TCAAGAACTT ATTGGTCAAC ACTCTATACA 5280 

CCGTCAGAGT GGCTGCGGTG ACTAGTCGTG GAATAGGAAA CTGGAGCGAT TCTAAATCCA 5340 

TTACCACCAT AAAAGGAAAA GTGATCCCAC CACCAGATAT CCACATTGAC AGCTATGGTG 5400 

AAAATTATCT AAGCTTCACC CTGACCATGG AGAGTGATAT CAAGGTGAAT GGCTATGTGG 5460 

TGAACCTTTT CTGGGCATTT GACACCCACA AGCAAGAGAG GAGAACTTTG AACTTCCGAG 5520 

GAAGCATATT GTCACACAAA GTTGGCAATC TGACAGCTCA TACATCCTAT GAGATTTCTG 5580 

CCTGGGCCAA GACTGACTTG GGGGATAGCC CTCTGGCATT TGAGCATGTT ATGACCAGAG 5640 

GGGTTCGCCC ACCTGCACCT AGCCTCAAGG CCAAAGCCAT CAACCAGACT GCAGTGGAAT 5700 

GTACCTGGAC CGGCCCCCGG AATGTGGTTT ATGGTATTTT CTATGCCACG TCCTTTCTTG 5760 

ACCTCTATCG CAACCCGAAG AGCTTGACTA CTTCACTCCA CAACAAGACG GTCATTGTCA 5820 

GTAAGGATGA GCAGTATTTG TTTCTGGTCC GTGTAGTGGT ACCCTACCAG GGGCCATCCT 5880 

CTGACTACGT TGTAGTGAAG ATGATCCCGG ACAGCAGGCT TCCACCCCGT CACCTGCATG 5940 

TGGTTCATAC GGGCAAAACC TCCGTGGTCA TCAAGTGGGA ATCACCGTAT GACTCTCCTG 6000 

ACCAGGACTT GTTGTATGCA ATTGCAGTCA AAGATCTCAT AAGAAAGACT GACAGGAGCT 6060 

ACAAAGTAAA ATCCCGTAAC AGCACTGTGG AATACACCCT TAACAAGTTG GAGCCTGGCG 6120 

GGAAATACCA CATCATTGTC CAACTGGGGA ACATGAGCAA AGATTCCAGC ATAAAAATTA 6180 

CCACAGTTTC ATTATCAGCA CCTGATGCCT TAAAAATCAT AACAGAAAAT GATCATGTTC 6240 

TTCTGTTTTG GAAAAGCCTG GCTTTAAAGG AAAAGCATTT TAATGAAAGC AGGGGCTATG 6300 

AGATACACAT GTTTGATAGT GCCATGAATA TCACAGCTTA CCTTGGGAAT ACTACTGACA 6360 
ATTTCTTTAA AATTTCCAAC CTGAAGATGG GTCATAATTA CACGTTCACC GTCCAAGCAA 
GATGCCTTTT TGGCAACCAG ATCTGTGGGG AGCCTGCCAT CCTGCTGTAC GATGAGCTGG 

GGTCTGGTGC AGATGCATCT GCAACGCAGG CTGCCAGATC TACGGATGTT GCTGCTGTGG 6540 

TGGTGCCCAT CTTATTCCTG ATACTGCTGA GCCTGGGGGT GGGGTTTGCC ATCCTGTACA 6600 

CGAAGCACCG GAGGCTGCAG AGCAGCTTCA CCGCCTTCGC CAACAGCCAC TACAGCTCCA 6660 

GGCTGGGGTC CGCAATCTTC TCCTCTGGGG ATGACCTGGG GGAAGATGAT GAAGATGCCC 6720 

CTATGATAAC TGGATTTTCA GATGACGTCC CCATGGTGAT AGCCTGAAAG AGCTTTCCTC 6780 

ACTAGAAACC AAATGGTGTA AATATTTTAT TTGATAAAGA TAGTTGATGG TTTATTTTAA 6840 

A^GATGCACT TTGAGTTGCA ATATGTTATT TTTATATGGG CCAAAAACAA AAAACAAAAA 6900 
AAAAAA 

Seq ID NO; 78 Protein Sequence 
Protein Accession #: NPJ>03096.1 

1 11 r i 1 i 1 r 

Ltrssrres rlpflftlva llppgalcev wtqrlhggsa plpqdrgflv vqgdprelrl 60 

WARGDARGAS RADEKPLRRK RSAALQPEPI KVYGQVSLND SHNQMWHWA GEKSNVIVAL 120 

ARDSLALARP KSSDVYVSYD YGKSFKKISD KLNFGLGNRS EAVIAQFYHS PADNKRYIFA 180 

DAYAQYLWIT FDFCNTIjQGF SIPFRAADLL LH9KASNLLL GFDRSHPKKQ LWKSDDFGQT 240 

WIMIQEHVKS FSWGIDPYDK PNTI YIERHE PSGYSTVFRS TDFFQSRENQ EVILEEVRDF 300 

QLRDKYMFAT KWHLLGSEQ QSSVQLWVSF GRKPMRAAQF VTRHPINEYY IADASEDQVF 360 

VCVSHSNNRT NLYISEAEGL KFSLSLBNVL YYSPGGAGSD TLVRYFANEP FADFHRVEGL 420 

QGVYIATLIN GSMNEENMRS VITFDKGGTW EFUQAPAFTG YGEKINCELS QGCSI^LAQR 480 

LSQLLNLQLR RMPILSKESA PGLIIATGSV GKNLASKTNV YISSSAGARW REALPGPHYY 540 
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TWGDHGGIIT AIAQGMETMB LKYSTNEGET WKTFIFSEKP VFVYGLLTEP GEKSTVFTIF 600 

GSNKENVHSW LILQVNATDA LGVPCTENDY KLWSPSDERG NECLLGHKTV FKRRTPHATC 660 

FNGEDFDRPV WSNCSCTRE DYECDFGFKM SEDLSLEVCV PDPEFSGKSY SPPVPCPVGS 720 

TYRRTRGYRK ISGDTCSGGD VEARLEGELV PCPLAEENEF ILYAVRKSIY R YD LAS GATE 7B0 

5 QLPLTGLRAA VALDFDYEHN CLYWSDLALD VIQRLCLNGS TGQEVI INSG LETVEALAFE 840 

PLSQLLYWVD AGFKKIEVAN PDGDFRLTIV NSSVLDRPRA LVLVPQEGVM FWTDWGDLKP 900 

GIYRSNMDGS AAYHLVSEDV KWPNGISVDD QWIYWTDAYL ECIERITFSG QQRSVILDNL 960 

PHPYAIAVFK NEIYWDDWSQ LSIFRASKYS GSQMEILANQ LTGLMDMKIF YKGKNTGSNA 1020 

CVPRPCSLLC LPKANNSRSC RCPEDVSSSV LPSGDLMCDC PQGYQLKNNT CVKEENTCLR 1080 

10 NQYRCSNGNC INS I WW CD FD NDCGDMSDER NCPTTICDLD TQFRCQESGT CIPLSYKCDL 1140 

EDDCGDNSDE SHCEMHQCRS DEYNCSSGMC IRSSWVCDGD NDCRDWSDEA NCTAIYHTCE 1200 

ASNFQCRNGH CIPQRWACDG DTDCQDGSDE DPVNCEKKCN GFRCPNGTCI PSSKHCDGLR 1260 

DCSDGSDEQH CEPLCTHFMD FVCKNRQQCL FHSMVCDGII QCRDGSDEDA AFAGCSQDPE 1320 

FHKVCDEFGF QOQNGVCISL IWKCDGMDDC GDYSDEANCE NPTEAPNCSR YFQFRCENGH 1380 

15 CIPNRWKCDR ENDCGDWSDE KDCGDSHILP FSTPGPSTCL PNYYRCSSGT CVMDTWVCDG 1440 

YRDCADGSDE EACPLLANVT AASTPTQLGR CDRFEFECHQ PKTCIFNWKR CDGHQDCQDG 1500 

RDEANCPTHS TLTCMSREFQ CEDGEACIVL SERCDGFLDC SDESDEKACS DELTVYKVQN 1560 

LQWTADPSGD VTLTWMRPKK MPSASCVYNV YYRWGESIW KTLETHSNKT NTVLKVLKPD 1620 

TTYQVKVQVQ CLSKAHNTND FVTLRTPEGL PDAPRNLQLS LPREAEGVIV GHWAPPIHTH 1680 

20 GLIREYIVEY SRSGSKMWAS QRAASNFTEI KNLLVNTLYT VRVAAVTSRG IGNWSDSKSI 1740 

TTIKGKVIPP PDIHIDSYGE NYLSFTLTME SDIKVNGYW NLFWAFDTHK QERRTLNFRG 1800 

SILSHKVGNL TAHTSYEISA WAKTDLGDSP LAFEHVMTRG VRPPAPSLKA KAINQTAVEC 1860 

TWTGPRNWY GIFYATSFLD LYRNPKSLTT SLHNKTVIVS KDEQYLFLVR VWPYQGPSS 1920 

DYWVKMIPD SRLPPRHLHV VHTGKTSWI KWESPYDSPD QDLLYAIAVK DLIRKTDRSY 1980 

25 KVKSRNSTVE YTLNKLEPGG KYHIIVQLGN MSKDSSIKIT TVSLSAPDAL KIITENDHVL 2040 

LFWKSLALKE KHFNESRGYE IHMFDSAMNI TAYLGNTTDN FFKISNLKMG HNYTFTVQAR 2100 

CLFGNQICGB PAILLYDELG SGADASATQA ARSTDVAAW VPILFLILLS LGVGFAILYT 2160 
KHRRLQSSFT AFANSHYSSR LGSAIFSSGD DLGEDDEDAP MITGFSDDVP MVIA 

30 Seq ID NO: 79 DMA sequence 

Nucleic Acid Accession fh Eos sequence 
Coding sequence: 120-755 

1 11 21 31 41 51 

^ CCTGCCACTC CCCTGCCAGT GTGCGAGGCA GCGTGAAGCT GGGGCCTGCT CCCCGCAGCC 60 
TCTGGAGCGC ATCTCAGACC TTCTGAGACC TATGTTGCTG GCCCCCCAGA ACCCGCAACA 120 
TGGCAGATGG GGCAAAGGCC AACCCCAAAG GGTTCAAAAA GAAGGTGCTG GATAGATGCT 180 
TCTCTGGGTG GAGGGGCCCA CGCTTCGGGG CCTCCTGTCC TTCAAGAACC TCCAGGTCTA 240 
40 GCCTGGGTAT GAAGAAGTTC TTCACCGTGG CCATCCTTGC TGGCAGCGTT CTGTCCACAG 300 
CTCACGGCAG CCTGCTCAAC CTGAAGGCCA TGGTGGAGGC CGTCACAGGG AGGAGCGCCA 360 
TCCTGTCCTT CGTGGGCTAC GGTTGCTACT GTGGGCTGGG GGGCCGTGGC CAGCCCAAGG 420 
ATGAGGTGGA CTGGTGCTGC CACGCCCACG ACTGCTGCTA CCAGGAACTC TTTGACCAAG 480 
GCTGTCACCC CTATGTGGAC CACTATGATC ACACCATCGA GAACAACACT GAGATAGTCT 540 
45 GCAGTGACCT CAACAAGACA GAGTGTGACA AGCAGACATG CATGTGTGAC AAGAACATGG 600 
TTCTGTGCCT CATGAACCAG ACGTACCGAG AGGAGTACCG TGGCTTCCTC AATGTCTACT 660 
GCCAGGGCCC CACGCCCAAC TGCAGCATCT ATGAACCGCC CCCTGAGGAG GTCACCTGCA 720 
GTCACCAATC CCCAGCGCCC CCCGCCCCTC CCTAGAGCCT CTGAGGTTTG AGAGAGAGAG 780 
CGGGAGGAGG GTCTGGCTTG GGGACCAGAC GAGGTGCAGG GAGGGTAGGA GCCAGGCCAG 840 
50 GAGCCTGAGG GTTGCTGGTT GCCTCCTCCC TGGAGCTCTC CAGTGAGGGC TCAGCTCTCA 900 
GAGGACTCAG GAAGGCCTGG GTCCTGACTC CCCCAGCCCA GCCCCAGGCA TGGGTGCCTC 960 
CTGCTGCTGG TTCTGGACTG GGTGGGAGGC ACGGAGCTTA TAGGGGTCTC TCCTGAGGGT 
GGCCGGGGAG ACCTGAGAGA GAGGAGGAGG GGCCTCTGAG TGGGGCCTCT GTTGCTGGCG 
CCAGTTTAAC TCCCCGGAGC CTTAGAAAGT CTGAGCCTTA GAAAGTCTGG GCCTGAGCAT 
55 CCAGGCCCAG AGCTGGATGC ATCCTCGGCC CAAGATCACA GGAAGGCAGA TTGCTGGTCA 

CAAAGCATGG GTTCCGGGAG CCCCTCAGCT GATCCCACAG GATGGCCTGG ^TGGTCGCT 
ACTTTGGGCT TGAAGCTCTC TAGAGCCCCA TTCACAGAGC AGGTGCCCCA CCCCAGCCCT 
CACTGAGGTG GCCATGAGTC CAGGTTCAGA CCCTAGGGGA TGTGCAGTGC TGAGTGTCCC 
AGCCCTGAAG TCTAGCCTCA GAGCAGGCTC CTGGGCACTG ACAGAGCCAC ACACCCAGGT 
60 ACACCCCCCA GGCTAATATG GGGACACACA AAAATATGGC CTTCAACACC TACAAGTCTT 
CTCTGTGTCC CCCATCGACC CATGCACATC ACCCCAACCC TGCTCTGCCA CACCACGTGG 
GTCTGAGGAG CACTCCTACC CCCCGGCTCC GGGTCCCTGA CAGACACTGC CCTCCTGGCC 
TGCACCTGGA ATGGCAGCCA GAACACTGGA GGGGGACCCA CACCCTCTCT CCCTGGGCTG 
TGGATGGGTG TTGCAAGTCC CATCCCTCCT GGCCTGCACC TGGAACAGCA GCCAGAACAC 
65 CGGAGGGGGA CCCACACCCT CTCTCCCTGG GCTGTGGGCA GGCGTCACAA GTCCCATTGG 
SS GCTGAGGGCT GAGACCCACC TCAGTGAAGA GGGAAACGTA AGCAGGGAGA 
GCATCAAGGC GGGAGGCAGC CTGGAAACTT CTAGAAGAGG GTCAAGGTAG GAGGCTACAT 
GGAAGCTTCT AGAAGGCAGT AAGGAACAGG GTGGTGAAGG AGAGGGGAGA AGATGGGAGC 
ATGGAGAGAC GAGGTGAGTG ACAGCCACCA GCAGGGACGG GCCTCCAGGC AACACAGAGG 
70 AGCTGGGGGC CAGGGCCTTA CGGGGCTGGG TTCCTGTGTG GGGCGTGGAG CTG^GCTGA 
AGTGGAGGGG ACGGCCCTGC GGCCCCCAGC ACTGTTGTTT CTCCAGGTGC ACCTTACCTT 
CCTCCTTACC TGCCCGAGCC TCAACTCAGG GTCAGCAGCC TCCACGGAGC CAGCCCCACC 
TGCCCGAAAA CCACTGAAGC CACCCTCCGC CCGACGTCAA CGGGGATCCA CTAGTTTAGA 
GCGCGGCCCC CGCGTGC 
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1 11 21 31 41 51 

I I I I I I 

MADGAKANPK GFKKKVLDRC FSGWRGPRFG ASCPSRTSRS SLGMKKFPTV AILAGSVLST 60 
AHGSLLNLKA MVEAVTCRSA ILSFVGYGCY CGLGGRGQPK DEVDWCCHAH DCCYQELPDQ 120 
GCHPYVDHYD HTIENNTEIV CSDLNKTECD KQTCMCDKNM VLCLMNQTYR EEYRGFLNVY ISO 
CQGPTPNCSI YEPPPEEVTC SHQSPAPPAP P 

Seq ID NO: Bl DNA sequence 

Nucleic Acid Accession fh NM_018136.1 

Coding sequence: 38.. 2218 

1 11 21 31 41 51 

GAAATCTGTA CGAACTATTC AGGCTGCTTT TAGAGGCATG AAAGTTAGAC AAAAATTGAA 60 
AAATGTATCA GAGGAAAAGA TGGCAGCCAT .TGTTAACCAA TCTGCACTCT GCTGTTACAG 120 
AAGCAAAACT CAGTATGAAG CTGTTCAAAG TGAAGGTGTT ATGATTCAAG AGTGGTATAA 180 
AGCTTCTGGC CTTGCTTGTT CACAGGAAGC AGAGTATCAT TCTCAAAGTA GGGCTGCAGT 240 
AACAATTCAA AAAGCTTTTT GTAGAATGGT CACAAGAAAA CTGGAAACAC AGAAATGTGC 300 
TGCCCTACGG ATTCAGTTCT TCCTTCAGAT GGCTGTGTAT CGGAGAAGAT TTGTTCAGCA 360 
GAAAAGAGCT GCTATCACTT TACAGCATTA TTTTAGGACG TGGCAAACCA GAAAACAGTT 420 
TTTACTATAT AGAAAAGCAG CAGTGGTTTT ACAAAATCAC TACAGAGCAT TTCTGTCTGC 480 
AAAACATCAA AGACAAGTCT ATTTACAGAT CAGAAGCAGT GTTATCATTA TTCAAGCTAG 540 
AAGTAAAGGA TTTATACAGA AACGGAAGTT TCAGGAAATT AAAAATAGCA CCATAAAAAT 600 
TCAGGCTATG TGGAGGAGAT ATAGAGCCAA GAAATATTTA TGTAAAGTGA AAGCTGCCTG 660 
CAAGATTCAA GCCTGGTATA GATGTTGGAG AGCACACAAA GAATATCTAG CTGTATTAAA 720 
AGCTGTTAAA ATTATTCAAG GTTGCTTCTA TACCAAACTA GAGAGAACAC GGTTTTTGAA 780 
TGTGAGAGCA TCAGCAATTA TCATTCAGAG AAAATGGAGA GCTATACTTC CTGCAAAGAT 840 
AGCTCATGAA CACTTCTTAA TGATAAAAAG ACATCGAGCT GCTTGTTTGA TCCAAGCACA 900 
TTATAGAGGA TATAAAGGAA GGCAGGTCTC TCTTCGGCAG AAATCTGCTG CTTTGATCAT 
ACAAAAATAT ATACGAGCCA GGGAGGCTGG AAAGCATGAA AGGATAAAAT ATATTGAATT 
TAAAAAATCT ACAGTTATCC TACAAGCACT GGTGCGTGGT TGGCTAGTAC GAAAAAGATT _ 
TTTAGAACAG AGAGCCAAAA TTCGACTTCT TCACTTCACT GCAGCTGCAT ATTATCACCT 1140 
GAATGCTGTT AGAATTCAAA GAGCCTATAA ACTTTACCTG GCTGTGAAGA ATGCTAACAA 1200 
GCAGGTTAAT TCAGTCATCT GTATTCAGAG ATGGTTTCGA GCAAGATTAC AAGAAAAGAG 12 60 
ATTTATTCAG AAATATCATA GCATCAAAAA GATTGAGCAT GAAGGTCAAG AATGTCTGAG 
CCAGCGAAAT AGGGCTGCAT CAGTAATACA GAAAGCAGTG CGCCATTTTC TCCTCCGTAA 
AAAGCAGGAA AAATTCACTA GTGGAATCAT TAAAATTCAG GCATTATGGA GAGGCTATTC 
TTGGAGGAAG AAAAATGATT GTACAAAAAT TAAAGCTATA CGACTAAGTC TTCAAGTTGT 
TAATAGGGAG ATTCGAGAAG AAAACAAACT CTACAAAAGA ACTGCACTTG CACTTCATTA 
CCTTTTGACA TATAAGCACC TTTCTGCCAT TCTTGAGGCC TTAAAACACC TAGAGGTAGT 
TACTAGATTG TCTCCACTTT GTTGTGAGAA CATGGCCCAG AGTGGAGCAA TTTCTAAAAT 
ATTTGTTTTG ATCCGAAGTT GTAATCGCAG TATTCCTTGT ATGGAAGTCA TCAGATATGC 
TGTGCAAGTC TTGCTTAATG TATCTAAGTA TGAGAAAACT ACTTCAGCAG TTTATGATGT 
AGAAAATTGT ATAGATATAC TATTGGAGCT TTTGCAGATA TACCGAGAAA AGCCTGGTAA 
TAAAGTTGCA GACAAAGGCG GAAGCATTTT TACAAAAACT TGTTGTTTGT ^CTATTTT 
ACTGAAGACA ACAAATAGAG CCTCTGATGT ACGAAGTAGG TCCAAAGTTG TTGACCGTAT 
TTACAGTCTC TACAAACTTA CAGCTCATAA ACATAAAATG AATACTGAAA GJ^TACTTTA 
CAAGCAAAAG AAGAATTCTT CTATAAGCAT TCCTTTTATC CCAGAAACAC CTGTAAGGAC 
CAGAATAGTT TCAAGACTTA AGCCAGATTG GGTTTTGAGA AGAGATAACA TGGAAGAAAT 
CACAAATCCC CTGCAAGCTA TTCAAATGGT GATGGATACG CTTGGCATTC CTTATTAGTA 
AATGTAAACA TTTTCAGTAT GTATAGTGTA AAGAAATATT AAAGCCAATC ATGAGTACGT 
AAAGTGATTT TTGCTCTCTG TGTACAACTT TTAAAATCTG ACTTTGTTTT AAAAAAACAT 
AAACTGTTCA TTACATTCTT CATTTTTATC ATTTATAGTT TTATGCATGT AATAAACTAA 
TATGTCATAA GATG 

Seq ID NO: 82 Protein sequence 
Protein Accession #: NP_060606.1 

1 n i 1 " i 1 r 

MKVRQKLKNV SEBKMAAIVN QSALCCYRSK TQYEAVQSEG VMIQEWYKAS GIACSQEAEY 

hsosraavti qkafcrkvtr kletokcaal riqfflqmav yrrrfvqqkr aaitlqhyfr 

HYRAFLSAKH QRQVYLQIRS SVIIIQARSK GFIQKRKFQE 
IKNSTIKIQA MWRRYRAKKY LCKVKAACKI QAWYRCWRAH KEYLAVLKAV KIIQGCFYTK 

^iiqrkw railpakiah ehflmikrhr aacliqahyr ^kgrqvslr 

Q^aLiIQK YIRAREAGKH ERIKYIEFKK STVILQALVR GWIjVRKRFLE ^«"» 

taaayyhlna vriqraykly lavknankqv nsviciqrwf rarlqekrfi QKyhsikkie 
S^eclsqr nraasviqka vrhfllrkkq ekftsgiiki QALWRGYSWR ^ctoka 

IRLSLQWNR EIREENKLYK RTALALHYLL TYKHLSAILE ALKHLEWTR LSPLCCENMA 
OSGAISKIFV LIRSCNRSIP CMEVIRYAVQ VLLNVSKYEK TTSAVYDVEN CIDILLELLQ 

?Se£S adkggsiftk tccllaxllk ttnrasdvrs rskwdriys lyxl™ 

MHTERILYKQ KKNSSISIPF IPETPVRTRI VSRLKPDWVL RRDNMEEITN PLQAIQMVMD 
TUGIPY 

Seq ID NO: 83 DNA sequence 
Nucleic Acid Accession <h NMJ)00612.2 
Coding sequence: 553.. 1095 
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| | | I I i 

TTCTCCCGCA ACCTTCCCTT CGCTCCCTCC CGTCCCCCCC AGCTCCTAGC CTCOGACTCC 60 

CTCCCCCCCT CACOCCCGCC CTCTCGCCTT CGCCGAACCA AAGTGGATTA ATTACACGCT 120 

TTCTGTTTCT CTCCOTGCTG TTCTCTCCCG CTGTGCGCCT GCCCGCCTCT CGCTGTCCTC 180 

TCTCCCCCTC GCCCTCTCTT CGGCCCCCCC CTTTCACGTT CACTCTGTCT CTCCCACTAT 240 

CTCTGCCCCC CTCTATCCTT GATACAACAG CTGACCTCAT TTCCCGATAC CTTTTCCCCC 300 

CCGAAAAGTA CAACATCTGG CCCGCCCCAG CCOGAAGACA GCCCGTCCTC CCTGGACAAT 360 

CAGACGAATT CTCCCCCCCC CCCCAAAAAA AAAAGCCATC CCCCCGCTCT GCCCCGTCGC 420 

ACATTCGGCC CCCGCGACTC GGCCAGAGCG GCGCTGGCAG AGGAGTGTCC GGCAGGAGGG 490 

CCAACGCCCG CTGTTCGGTT TGCGACACGC AGCAGGGAGG TGGGCGGCAG CGTCGCCGGC 540 

TTCCAGACAC CAATGGGAAT CCCAATGGGG AAGTCGATGC TGGTGCTTCT CACCTTCTTG 600 

GCCTTCGCCT CGTGCTGCAT TGCTGCTTAC CGCCCCAGTG AGACCCTGTG CGGCGGGGAG 660 

CTGGTGGACA CCCTCCAGTT CGTCTGTGGG GACCGCGGCT TCTACTTCAG CAGGCCCGCA 720 

AGCOGTGTGA GCCGTOGCAG CCGTGGCATC GTTGAGGAGT GCTGTTTCCG CAGCTGTGAC 780 

CTGGCCCTCC TGGAGACGTA CTGTGCTACC CCCGCCAAGT CCGAGAGGGA CGTGTCGACC 840 

CCTCCGACCG TGCTTCCGGA CAACTTCCCC AGATACCCCG TGGGCAAGTT CTTCCAATAT 900 

GACACCTGGA AGCAGTCCAC CCAGCGCCTG CGCAGGGGCC TGCCTGCCCT CCTGCGTGCC 960 

CGCCGGGGTC ACGTGCTCGC CAAGGAGCTC GAGGCGTTCA GGGAGGCCAA ACGTCACCGT 1020 

CCCCTGATTG CTCTACCCAC CCAAGACCCC GCCCACGGGG GCGCCCCCCC AGAGATGGCC 1080 

AGCAATCGGA AGTGAGCAAA ACTGCCGCAA GTCTGCAGCC CGGCGCCACC ATCCTGCAGC 1140 

CTCCTCCTGA CCACGGACGT TTCCATCAGG TTCCATCCCG AAAATCTCTC GGTTCCACGT 1200 

CCCCCTGGGG CTTCTCCTGA CCCAGTCCCC GTGCCCCGCC TCCCCGAAAC AGGCTACTCT 1260 

CCTCGGCCCC CTCCATCGGG CTGAGGAAGC ACAGCAGCAT CTTCAAACAT GTACAAAATC 1320 
GATTGGCTTT AAACACCCTT CACATACCCT CCCCCC 

Seq ID NO: 84 Protein Sequence 
Protein Accession #: NP_000603.1 

1 11 21 31 41 51 

MGIPMGKSML VLLTFLAFAS CCIAAYRPSE TLCGGELVDT LQFVCGDRGP YFSRPASRVS 60 
RRSRGIVEEC CPRSCDLALL ETYCATPAKS ERDVSTPPTV LPDNFPRYPV GKFFQYDTWK 120 

QSTQRLRRGL PALLRARRGH VLAKELEAFR EAKRHRPLIA LPTQDPAHGG APPEMASNRK 180 



Seq ID NO: 85 DNA sequence 

Nucleic Acid Accession ftt XM_092643.1 

Coding sequence: 1-5352 



1 11 21 31 41 51 

ATGACTGAAG CCATTACAGA AGCAGCAGTA GCCTCAAGTG AGGAGGTGTC AGAGGAAAGA 60 

GACGACCTGG GGCCTCTTGA ATTACATGAC AGTGGTACCT TTCAACAAGT CGTGAACCTT 120 

CTGGACATCA TTGACAGCGA GTCAGCAAAG ACGGACACAA CAGGGGCAGG CCTTGACATG 180 

CGGAAGACCC TGGCCTCGGT GATAATCATG GAGAAGGCCA CCACTGAGCC TTCTGTAGTG 240 

ATAAACACTC TCATCCGCTG CCTGCAGGTG CCAGAGATTT CCACCCAGCG CAAGCTCAAC 300 

ATTTACAACA TCCTCCAGGA CATCATCCAG CAGGAGGGGG AGCTGGAGGA GCAGTGCGTG 3 60 
CAGAGGCTGG TGGCCATTGC CTCCAAGGAG ATGAGGGAGA TCCCAGAGAT GGAGGGCTAT 
ATGAAGGCAG AGGTGGCCAG CGACACACTG GTGGCTCTGT CCCGAAACCA CTTCAGCTTG 
GTCATGTACG AGCTGCAGCA CCACCTCAAG CCCCTCAACC TCACTGATGA ATTTGTCATC 
ATCACACTGG CCAAGCTGGC CAACGGCAAT GTGTTTGAGT TCATGCCATA CAT GGG CATC 

ACCCTGGCTA CCATATTCAC CATGCTGAGA CTTGCCAATG AAGCCAAGAT ACGCCAGGCG 660 

ATCTGCAGTG CCATGGAGAC CTTCTGTGAG ACGGTGCAGT TTTATCTGAA GCACCTGGAG 720 

GAGAGCGTGT ACCCCGTGAT GACTGAGGAG GAGTTTGCCC TGAAGGTGTT CCCCATGTAT 780 

CGCTACTTCG TGACAGTGTG GCTGAGGCAC TACAACCCCG AGGTGAAGCT GGGGGTGATC 840 

AAGTCCCTGA AGCCCATGCT CGGCCTCCTT CTGCCCAACG ATGACCTGCG GGAGCAGCTC 900 

TACGACTACA TCCCCCTGCT GCTGGCGGAG TACCAGGGCA GTCTGGAGGT CTTG^GCAG 960 

ATCCTGGAAC TGTCAGTCAC CACCAACACC CCTGTCCCCC AAATGCAGCT ACACACCATT 1020 

TTCACAGAAC TGCACGTCCA GGTGTGCAAC AAGGCCCCGG CCCAGCATCA GTACAGCAGC 1080 

CAGAATCTGA TGGAGATGGT GCACTGCTTC GTAGCCCTTG CTCGCTCCTA CTCCAAGGAG 1140 

CTGATGAAGT TCTTCTTCAG CCAGATGGAG ACAAACAAGG AGGCCGTCCG CGTGGGGACT 1200 

CTGAATCTGA TTAGGGCTAT AGTGAGCGCA GATGAGCCCA GGATGAGTAT CAGGGCCATC 1260 

TACCTGGCTA TCCGGGTAGT CAAGAACACC ATCTCTGATA CCCGGTCCAA GGTGAGGATG 1320 

GCTATTCTCC ACATCATTGG GCAGTTGGCT CTCTGTGGCT ACCAGGAGAG AATCAAAGGC 1380 

TGGGGCCTGA AGTACCTGTC TGTGCAGCTG ACCTTATCCA CCTACAAACT GACAAATCGC 1440 

CGGGAGAAGT TTTATCAGAG GGACTTGGAG GAGAGGATGG TCCACAAAGT CACCATGGAC 1500 

ACTGTGAAGA TCATTACCTC TTCTGTCAGT GGGATGACCA CCGAGTTTTG GGTGAGGCTG 1560 

CTGTGCTACA TCATGGAGAC AGACTACGTG GAAGCTTTGA CTCCTATCTG TATCAGCCTC 1620 

ACAAACCTGG CAGAACACCA GCTCCATGGC CAGGATGTGG ATGTCAGCGT GGCTGGCAAG 1680 

AGCAGGCAAG TGGACCTGCC TGCACCTCAG AAGCTGCTGG CCCGTCTCCT GGTCCTGATG 1740 

TCATCACCTT ACAAGGGGGA GGGTCGTGGG ATAGCCATGC TCAACCTCTT GAGGACCCTG 1800 

AGCCAGAGCA TCGCACCCTC CATGGCCGAC ATGTGGGAGC TGGAGATTGC GCTACTGGTC 1860 

CGGTACCTGG AAGAACATAC TGAGTTCACT TGGGATCAGA AAGCCTGGGA AGACAAGCTC 1920 

ATTCAGTTTC TGCGAAACTC CCTCAAGAAG ACCCGGGGGT CTAGCTGGAG CCTGCGCTTG 1980 

AGTAAAGAGC TGAACAACCA GATTGCGAGC TTTGACAGCC CCTCTCTGGA GAAGGGCTTT 2040 

CTGTACCGGG CCTTGGGCTT caccttggcc ACAGGCCTGG AGGCCAGCAA GGTGGAGGTC 2100 

CTGCTGTTGG AGCTGCTGTA CAAGACGGAC TACAGCAATG ACTTTGACAG CGAGGGTGTG 2160 

262 



360 
420 
480 
540 
600 



WO 03/003906 



ATTATGTGCT 
GACTTCGAGG 
GACCATCCCT 
GCCTCCTACT 
ATCATTCACC 
AGTGTTGTGC 
GCCCAGAAGA 
AACCTGGTTT 
AAGCCTTTCT 
ATTTCTCTCC 
CTGAGCTCCC 
CTGCAGGAGA 
GAAAAGGCCG 
• TGTATCCACC 
TGTGATGCCC 
CTTCACGCAA 
TGTAAGGGGG 
GCCAAGGTGG 
TGCGAGAACA 
TTCTTCCTCC 
ATCCTGGTGC 
ATCCTGCTGC 
CTGCCCATGG 
GCCCGGACCA 
AGTGCCACCT 
TGCACCATCC 
CCTGACCTCA 
CCGCCGGTCT 
CTGCAAAGGT 
GTCAAGAGCC 
GACGGCGGGA 
GAGGGCCACA 
AGCTGCCGCA 
CCAGTTCTGT 
GACCAGGAGG 
CTGGGCGCCC 
CCCCTGAGGG 
ATCCTGGCTG 
TGCAGGATCT 
GGAAAGCTGG 
AAGGCCTGGA 
AGATACATCC 
CTTGGGTCTG 
TGGAAGTCCC 
GTTTTCCAGA 
TTCTTGCTAG 
TGCAACTTGG 
TTCCCAGCAT 
GGCCCAAAGG 
GACAGCGTGC 
TGTTCCCAGT 
TGCTATGCTG 
CTGTTTTGTG 
AAACAAGCTT 



TTGGCCPGTG 
AGAGGATCCA 
GGAGGCGGGA 
GCCACCCCCA 
ATTATGTCAG 
AGGTTACCAA 
CGACTCTTAC 
CTCCAGTGCG 
ACTCCACAGA 
AACTCCCAGG 
TGGAGCAGCT 
TGGTGCAGCT 
TGAGCCTCCA 
TAAAGCTGGG 
ATCAAAGAAC 
GCCAGACCTG 
ACCTCCAGAG 
TCTGCATGGA 
CTGGGGCCAT 
AGATGCGGGC 
ACCTGCCGGT 
TGGCGCACCA 
AGAGCCACCT 
TGCTCCACAG 
CCAAGGCTGA 
ACCTTCTCAT 
TCTACACCCT 
TGAAGATGTG 
GCTCTCGAGG 
AGCACCTGGC 
CATTCCTGGA 
GGCAGAGGCT 
TCAGCAGCAC 
ACCAGGAGAA 
AAGACGAGGC 
CCAAGAAGGT 
AGCCCGTGAG 
AGCTCCGGGA 
TCTTCGACAA 
CAAGGGTGGT 
TCCCCCTCAT 
TGGGCTTTGT 
CTGGCTCGGC 
TGGAGCATCC 
CAACCATGTG 
AAACAATGGC 
CAGGAATTAT 
TACGGAATTC 
GTCGGAAACT 
CTAAGGCCAG 
GCAACTGCCA 
TCCAAAATCC 
GAGCAAATGG 
GA 



TGCCCGGGGC 
GGAGTCAGAG 
GACAGTGAAA 
GTTGCTCCTC 
CAGCTGCCAG 
GGCCATCAAC 
CAGCATTATA 
AGCCTTGGCG 
GGAAAACAGT 
AGAGGACAAT 
GATGGAGAGC 
CCTGGAAAAG 
TCTCTATCTC 
GCAGTTTGGC 
CCGCATGGCC 
CTCCTTGTGG 
CACAGATGTG 
GTTTAGCTGC 
GAACCTGCAG 
CAAGGAGCTG 
GGTGGACCAC 
CCACCAGGAG 
GGCAGAGGTG 
CCTGATGGGC 
CATCTGGCGC 
TCAGAAGCTG 
CCTGCTGCAG 
GAAGCTGGTC 
GCGGGTCACT 
ACATACCCTG 
GGGTGTGAGC 
GGCCGAGCTG 
AGCGGTCTGC 
GCTGCTGAAG 
CCTGCGGGTG 
GAAGCAGTAC 
CAACAGCGTG 
AGGGGATGTG 
CGAGAGCGAG 
CGGGATGTCC 
GCTGCACTCC 
GTCCCAGTCT 
CTGTATGGCT 
CTCAGGGCCA 
CTCCATCCTG 
CTATGTTAAA 
TATGAAGCAG 
CCTCCAGGAA 
AGCTTTGTGT 
GCACTGTGCC 
AAGGAATTCA 
AACCCGTGCT 
TGACTTGATT 



CAGGTAAAAA 
CAGTCCTGGC 
AGTGCCCTCA 
AACCTCGTGG 
GACATCTGTC 
AACATCAAGG 
GTGGCGGTCA 
ATGGAGGCCC 
GAGCTGATGG 
GAGTCCATTA 
CTCCTGCAGA 
TGGATCTTGT 
ATGTGGATTT 
ACAATGGTCG 
TCAATGAATG 
GGCCCTTCCA 
GAGAAGATCT 
GATGAGGTGG 
CATGACAAGG 
GAGGACAAGG 
CCAGAGGTGC 
ACCATCCTCA 
TGGCTGGCAG 
CGGCTGCAGT 
CTGGCTGCGG 
GATGAGAATG 
CTTGGAAGCA 
CACACCACTC 
ATCAAGTCCA 
GACGAGCAGG 
CTGCTGGCCA 
GTGCTCAGGG 
GTGGAAATGA 
CCGGCAGCTT 
CTGTCCCTGC 
CGGAAGGTCT 
ACTGCCGAGG 
GGGTCCTCTT 
CTGCTGCGTC 
AAGAAGCATT 
CAGGACCCCT 
GGGGCCCGCT 
ACCATGTTTC 
AGTGATACCG 
ACTCGGAAAA 
AATAACTTGT 
ATGTCTACAC 
CTACAGCTGG 
CTTCACTOGG 
AAGCGAGCAG 
CCATCCGGGG 
AAAGCCTTCT 
TCCTGCCCAG 



CGGTGCTGAA 
AGATCAGTGC 
TGGTGATGTA 
ACAGCCCCAT 
TCAAAATGGC 
ACCTGGAGGA 
TCAAGGCAGA 
TCTCGCACCT 
ATATCAGCAT 
AGACCCTGTA 
GGCAGCTGGA 
CGGAGAAAGA 
ATGTCCACAG 
GACTCATTGC 
TCCTGTCCAG 
AGCAGAAGGA 
TCTGTGCATC 
TCTCGCTCAT 
CCTCTGTCAC 
TGGCCGAGAT 
GGCGCCTTCT 
CATCGCTCCT 
TGTCGGAGAA 
CACGGCTCAG 
TGGACCCCCT 
ACAAGCTCCC 
GCCACCGACC 
CTCTGCCGGA 
TGCAGCTCTT 
CAGTGTGGGA 
GGCTGTGCAT 
GCATGGACTC 
GGCACCGGTT 
TGCTGCTGGA 
GCGCCCTCGG 
TGCTGGAGAA 
GCATGGAGGC 
TCGACGCCAT 
TGAAAGCCTT 
TCTTCAAAGG 
GCTCCAATGC 
GTTCCCCGAG 
AGTGTGTGCA 
CTACTGATGA 
AGCCGGCTGT 
CAAGAATCAG 
ATTATCTGAA 
ACCCGGATCC 
AGTTCATCCA 
GTAAACCAGG 
TCCCATCAAC 
CTGGAACAGT 
TGATAAAGAG 



TGTGCTTCAT 
TTGGCGGAAG 
TAGCTGCGTG 
CACCGCTAAG 
CTTCATGAAG 
CTTTCACTTT 
ACCGACTGAC 
GAGCAAGCTG 
ACATTCTGTA 
TGCAAATGCC 
CCCCAAGGGG 
ATGGGAGCGG 
CACTGCTGTC 
CCCGTGCACC 
CCTGCTAGAT 
GCTTGAGAAA 
CTCCAGAATC 
CCAGAAGCTC 
CTGGATAGCC 
CCTGAGTGCC 
CATTGACGGC 
GAGGCAGCCA 
CGTGCCCTTC 
CCCCAGAATC 
GATGACCCTG 
GGACTTCCTC 
AGAGGCCGCC 
GGAGATGAAC 
GTTCAAGAGA 
CCTCCTGCAG 
GCAGCACGTG 
AGAAGTCCTG 
CATGAGCGGC 
GAAGGGTGCC 
CAACATGGCC 
GTGCCTGGGC 
CCTGACCAAG 
GTCTGAGCAG 
CATCCTCTTT 
GGAGGTGAAG 
AGCCCAAGTA 
GAAACAGGTC 
CTTCTGGGGC 
CAAGATGACC 
TCTCTACCGC 
AATCGCTGCT 
AAAGCTGGAC 
CGGGCCCACT 
CACGCATGCT 
ACTTCCAAAC 
AGGAGAGAGC 
GGTCTTCAAA 
TATTGCTGCT 



2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 



Seq ID NO: 86 Protein sequence j 
Protein Accession #: XP_0 92 643.1 



l 

I 

MTEAITEAAV 
RKTLASVIIM 
QRliVAIASKE 
ITLAKIiANGN 
ESVYPVMTEE 
YDYIPLLliAE 
QNLMEMVHCF 
YIAIRWKNT 
REKFYQRDLE 
TNLAEHQLHG 
SQSIAPSMAD 
SKELNNQIAS 
IMCFGLCARG 
ASYCHPQLLIi 
AQKTTLTSII 
ISLQLPGEDN 
EKAVSLHLYL 
LHASQTCSLW 



11 

1 

ASSEEVSEER 
EKATTEPSW 
MREIPEMEGY 
VFEFMPYMGI 
EFALKVFPMY 
YQGSLEVLRQ 
VALARSYPKE 
ISDTRSKVRM 
ERMVHKVTMD 
QDVDVSVAGK 
MWELEIALLV 
FDSPSLEKGF 
QVKTVLNVLH 
NLVDSPITAK 
VAVIKAEPTD 
ESIKTLYANA 
MWIYVHSTAV 
GPSKQKELEK 



21 
I 

DDLGPLELHD 
INTLIRCLQV 
MKAEVASDTL 
TLATIFTMLR 
RYFVTVWLRH 
IbELSVTTNT 
LMKFFFSQME 
AILHUGQLA 
TVKIITSSVS 
SRQVDLPAPQ 
RYLEEHTEFT 
LYRAI/3FTLA 
DFEERIQESE 
IIHHYVSSCQ 
NLVSPVRALA 
LSSLEQLMES 
CIHLKLGQFG 
CKGDLQSTDV 



31 

I 

SGTFQQWNL 
PEISTQRKVN 
VALSRNHFSL 
LANEAKIRQA 
YNPEVKLGVI 
PVPQMQLHTI 
TNKEAVRVGT 
LCGYQERIKG 
GMTTEFWVRL 
KUjARLLVLM 
WDQKAWEDKL 
TGLEASKVEV 
QSWQISAWRK 
DICLKMAFMK 
MEALSHLSKL 
LLQRQLDPKG 
TMVGLIAPCT 
EKIPCASSRI 



41 
I 

LDIIDSESAK 
IYNILQDIIQ 
VMYELQHHLK 
ICSAMETFCE 
KSLKPMLGLL 
FTELHVQVCN 
LNLIRAIVSA 
WGLKYLSVQL 
LCYIMETDYV 
SSPYKGEGRG 
IQFLRNSLKK 
LLLELLYKTD 
DHPWRRETVK 
SWQVTKAIN 
KPFYSTEENS 
LQEMVQLLEK 
CDAHQRTRMA 
AKWCMEPSC 



51 

I 

TDTTGAGLDM 
QEGELEEQCV 
PLNLTDEFVI 
TVQFYLKHLE 
LPNDDLREQV 
KAPAQHQYSS 
DEPRMSIRAI 
TLSTYKLTNR 
EALTPICISL 
IAMLNIiLRTL 
TRGSSWSLRL 
YSNDFDSEGV 
SALMVMYSCV 
NIKDLEDFHF 
ELMDISIHSV 
WILSEKEWER 
SMNVLSSLLD 
DEWSI»IQKL 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
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CENTGAMNLQ HDKASVTWIA FFLQMRAKEL EDKVAEILSA ILVHLPWDH PEVRRLLIDG 1140 

ILLLAHHHQE TILTSLLRQP LPMESHLAEV WLAVSENVPF ARTMLHSLMG RLQSRLSPRI 1200 

SATSKADIWR LAAVDPLMTL CTIHLLIQKL DENDKLPDFL PDLIYTLLLQ LGSSHRPEAA 1260 

PPVLKMWKLV HTTPLPEEMN LQRCSRGRVT IKSMQLLFKR VKSQHLAHTL DEQAVWDLLQ 1320 

DGGTFIaEQVS LLARLCMQHV EGHRQRLAEL VLRGMDSEVL SCRISSTAVC VEMRHRFMSG 1380 

PVLYQEKLLK PAALLLEKGA DQEEDEALRV LSLRALGNMA LGAPKKVKQY RKVLLEKCLG 1440 

PLREPVSNSV TAEGMEALTK ILAELREGDV GSSFDAMSEQ CRIFFDNESE LLRLKAFILF 1500 

GKLARWGMS KKHFFKGEVK KAWIPLMLHS QDPCSNAAQV RYILGFVSQS GARCSPRKQV 1560 

LGSAGSACMA TMFQCVHFWG WKSLEHPSGP SDTATDDKMT VFQTTMCSIL TRKKPAVLYR 1620 

FLLETMAYVK NNLSRIRIAA CNLAGIIMKQ MSTHYLKKLD FPALRNSLQE LQLDPDPGPT 1680 

GPKGRKLALC LHSEFIHTHA DSVPKARHCA KRAGKPGLPN CSQSNCQRNS PSGVPSTGES 1740 
CYAVQNPTRA KAFSGTWFK LFCGANGDLI SCPVIKSIAA KQA 



Seq ID NO: 87 DNA sequence 

Nucleic Acid Accession #; CAT cluster 

1 11 21 31 41 51 

CACTTCTCCC TCTCCTTAGT TCTCCGGTAA CCAAACCTTC TCTCGTTATA TACCCAATGA 60 

TAAAACCAAA GTAATGTTGT ACAGGTTAAA GATTCAAAAC AAACCATAGC AACTGAAACA 120 

GGGTATCAGG ACGCAAATGC TTGGCTGGAA TGGATTAAAC ATTCTGTTGC AGGCTAAACA 180 

AAAGCGCCTG TTACACTTGT GTGACAGGCA GGCCAGAGAC CCAAATTGTT CCCTTTCCAC 240 

TTGGGTGGTC CTCTCACAGA CCGGGCATGA GCTGTGTGGT AGCTCTCTTC CAGAACCCCA 300 

CAGCCTGGGG TGGTGAGTCA TGCAAGACTC TTTCACTGCT GTTCCCTGAG GTCAAGAGCC 360 

CTGCAGGTCA GCCCTGAGGA CTATCCGGCC TCCAGCCCCT GATGTTAACT TCACCTCATG 420 

CCTTTCACGG CAGGGGGAAA AGTTAGCATT CCTTGGAGAC TTAACAGGGT GCAGTGAAAC 480 

CAGGCCTTTT GAAGAGCTTA CCAATCAGTC TGCCCTTGTT CATCCCCGAG CAGATGTGTG 540 

GTGGTATTGC TGGGGACTAC TGCTGGGTAC TCTGCCAAGT AATTAGAGCA GCACTCATGC 600 

TCTAGTCCAA CTGGCCATCC CTTTCACCCT AGCATTCTGT CAATATGATA AAAAAAAATG 660 

TAAAGAAGTG CCTCACGTGG GTCCTTGACC CCCATGTTTA TATAAACACT ATTGGGTTCT 720 
ATGAGGAGAA CCCGATGA 



Seq ID NO: 88 DNA sequence 

Nucleic Acid Accession #: FGENESH predicted 

Coding sequence: 1..759 

1 11 21 31 41 51 

atgcaggaag aaacaggctg gcagcttggg gaaaggattc ttacatacca cggcgaccag 60 

GTAACTCTGT GCACAGACCC AGAAGAAAGT AGAAAAATAA GTTTTGGAGG AAACCTCATT 120 
GTGAACACAC CTCACCAGGT CAGAACTATC CTAAGTCAGA AAGCAGAAAG GTGGCTTACT 180 
GACTCCAGAA TCTTAAAGTA TGAGGCTATT CTCTTAGAAA AAGATGATTT AACATTAACC 
ACTGATAATT CACTTAACCC AGCAGGTTTC CTAACAGGGG ATCTATATCA AAAGAGAGAG 
CACACATGTT TAGTTTTAAT TAATTACCAT ACAAAGGTCC GACCAGACCT AGGAGAAACT 
CCCTTCAGGA CGGGACAACA CTTTATAGAT GGTTCCTCCC AGGTGATAGA GGGAAAAAGA 
TACAATGGGT ATTCAGTAAT TGATAGAGAA ACTCTTGTAG AAATAGAGTT AGGAAAATTG 
CCTAATAATT GGTCTGCTCA AACATGTGAG CTGTTTGCAC TCAGCCAAGC CTTAAAGGAC 
SISS SSAC CATCTATACA GATTCTAAGT ACG^TTTGG AGTGGCTCAT 
ACATTTGGAA AAATTTGGAC TGAACGAGGT CTTATTAATA GCAAATGCCA AGACTTGGTC 
CATAAGGAAT TAATCACCCA AGTATTAACT AACCTTCAGC TGCCAAAAGA AACAGCTATT 
GACCATGTCC CCGGACACCA AAAAAGCCTT TCTTTTTAA 



240 
300 
360 
420 
480 
540 
600 
660 
720 



Seq ID NO: 89 Protein Sequence 
Protein Accession #: FGENESH predicted 



21 31 41 51 

Leetgwqlg eriltyhgdq vtlctdpees rkisfggnli vntphqvrti ^sqkaerw^ 60 

DSRILKYEAI LLEKDDLTLT TDNSLNPAGF LTGDLYQKRE HTCLVLINYH TKVRPDLGET 
PFRTGQHFID GSSQVIEGKR YNGYSVIDRE tlveielgkl PNNWSAQTCE LFALSQALKD 
LQNQEGTIYT DSKYAFGVAH TFGKIWTERG LINSKCQDLV HKELITQVLT NLQLPKETAI 
DHVPGHQKSL SF 



120 
180 
240 



Seq ID NO: 90 DNA sequence 
Nucleic Acid Accession <h NM__000756.l 
Coding sequence i 186.. 776 

1 11 f I" I' I' 

AGAAACTCAG AGACCAAGTC CATTGAGAGA CTGAGGGGAA AGAGAGGAGA GAAAGAAAAA 
GAGAGTGGGA ACAGTAAAGA GAAAGGAAGA CAACCTCCAG AGAAAGCCCC CG^GACGTC 
TCTCTGCAGA GAGGCGGCAG CACCCGGCTC ACCTGCGAAG CGCCTGGGAA GTCAGTGCCC 
CTAACATGCG GCTGCCGCTG CTTGTGTCCG CGGGAGTCCT GCTGGTGGCT CTCCTGCCCT 240 
GCCCGCCATG CAGGGCGCTC CTGAGCCGCG GGCCGGTCCC GGGAGCTCGG CAGGCGCCGC 300 
GCCCTTGGAT TTCTTCCAGC CGCCGCCGCA GTCCGAGCAG CCCCAGCAGC 360 
CGCAGGCTCG GCCGGTCCTG CTCCGCATGG GAGAGGAGTA CTTCCTCCGC CTGGGGAACC 
SS ScSt CCCCTTTCGC CCGCCTCCTC GCTCCTCGCC OUGGCAGCG 
GCAGCCGCCC TTCGCCGGAA CAGGCGACCG CCAACTTTTT CCGCGTGTTG CTGCAGCAGC 

264 



60 

120 

180 



360 
420 
480 
540 
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TGCTGCTGCC 
ATGCCCTCGG 
TCTCCCTGGA 
AGTTAGCACA 
GTGCGTTTGG 
TCTGTACCAT 
GGGAGAGAGG 
GCAGCAACAC 
TGGTGTTTTA 
AGTCACTCAA 
TAAGACTAAA 
ACATCTTTCC 
TAATAAAAGT 



TCGGCGCTCG 
CGGCCACCAG 
TCTCACCTTC 
GCAAGCTCAC 
CCAAAAAGAA 
AGCGCTGCTC 
GAGAGAGCCT 
AATGTTATTC 
AAGAGAATGT 
TTGTTTTTGT 
TCTGTAAGCT 
AGTCCTTGGT 
TGCAAAGGT 



CTCGACAGCC 
GAGGCACCGG 
CACCTCCTCC 
AGCAACAGGA 
TCTGCATTTA 
TTATGCCATT 
ATACCCCTTA 
GTTTTGTCTA 
AGACCTGTGA 
TGTGGTCTGA 
CTTTGAAACA 
CAGTTTGGTT 



CCGCGGCTCT 
AGAGAGAAAG 
GGGAAGTCTT 
AACTCATGGA 
GCACAAAAAA 
TGTTTATTTT 
CTTAGCATGC 
CGTTTAGTTT 
GAAAACGTTT 
GCCAAAGAGA 
ACTTTCTCTT 
GTGTAAGAGA 



CGCGGAGCGC 
GCGGTCCGAG 
GGAAATGGCC 
GATTATTGGG 
AATTTAAAAA 
TATATAGCTT 
ACAAAGTGTA 
CCGTTTCCAG 
TGTTTGAAAA 
ATGCCATTCT 
GTAAACGTTT 
ATGTTGAATA 



GGCGCTAGGA 
GAGCCTCCCA 
AGGGCCGAGC 
AAATAAAACG 
AATACAGTAT 
GAAACATAGA 
TTCACGTGCA 
GTGTTTATAG 
AGCAGACAGA 
CTTGGGTGGG 
CAGTAATAAA 
CTTATATTTT 



600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 



Seq ID NO: 91 Protein Sequence 
Protein Accession ftt NP_000747.i 



11 



21 31 41 51 

| | | I I i 

MRLPLLVSAG VLLVALLPCP PCRALLSRGP VPGARQAPQH PQPLDFFQPF PQSEQPQQPQ 60 
ARPVLLRMGE EYFLRLGNLN KSPAAPLSPA SSLLAGGSGS RPSPEQATAN FFRVLLQQLL 120 
LPRRSLDSPA ALAERGARNA LGGHQEAPER ERRSEEPPIS LDLTFHLLRE VLEMARAEQL 180 
AQQAHSNRKL MEIIGK 

Seq ID NO: 92 DNA sequence 

Nucleic Acid Accession ft: NMJJ04217.1 

Coding sequence: 58.. 1092 



GGCCGGGAGA 
GCCCAGAAGG 
AGCACCCTGC 
ATGAGCCGCT 
AGTGGGACAC 
CCTCTGGGCA 
ATCGTGGCGC 
CTGCGCAGAG 
AACTATTTTT 
CTCTACAAGG 
GAGGAGTTGG 
AAGCCAGAAA 
TCTGTGCATG 
CCAGAGATGA 
CTTTGCTATG 
TATCGCCGCA 
GACCTCATCT 
TCAGCCCACC 
TCTGTCGCCT 
TATAGGGGAA 
TTAATAAAGG 



11 
I 

GTAGCAGTGC 
AGAACTCCTA 
CCCAGCGAGT 
CCAATGTCCA 
CCGACATCTT 
AAGGCAAGTT 
TCAAGGTCCT 
AGATCGAAAT 
ATGACCGGAG 
AGCTGCAGAA 
CAGATGCTCT 
ATCTGCTCTT 
CGCCCTCCCT 
TTGAGGGGCG 
AGCTGCTGGT 
TCGTCAAGGT 
CCAAACTGCT 
CTTGGGTCCG 
GATGGTCCCT 
AGAAGGGATC 
CTGAAGCTTT 



21 
i 

CTTGGACCCC 
CCCCTGGCCC 
CCTCCGGAAA 
GCCCACAGCT 
AACGCGGCAC 
TGGAAACGTG 
CTTCAAGTCC 
CCAGGCCCAC 
GAGGATCTAC 
GAGCTGCACA 
AATGTACTGC 
AGGGCTCAAG 
GAGGAGGAAG 
CATGCACAAT 
GGGGAACCCA 
GGACCTAAAG 
CAGGCATAAC 
GGCCAACTCT 
GTCATTCACT 
CCTAACTGTT 
TTGT 



31 
I 

AGCTCTCCTC 
TACGGCCGAC 
GAGCCTGTCA 
GCCCCTGGCC 
TTCACAATTG 
TACTTGGCTC 
CAGATAGAGA 
CTGCACCATC 
TTGATTCTAG 
TTTGACGAGC 
CATGGGAAGA 
GGAGAGCTGA 
ACAATGTGTG 
GAGAAGGTGG 
CCCTTTGAGA 
TTCCCCGCTT 
CCCTCGGAAC 
CGGAGGGTGC 
CGGGTGCGTG 
CCCTTATCTG 



41 
I 

CCCCTTTCTC 
AGACGGCTCC 
CCCCATCTGC 
AGAAGGTGAT 
ATGACTTTGA 
GGGAGAAGAA 
AGGAGGGCGT 
CCAACATCCT 
AGTATGCCCC 
AGCGAACAGC 
AGGTGATTCA 
AGATTGCTGA 
GCACCCTGGA 
ATCTGTGGTG 
GTGCATCACA 
CTGTGCCCAC 
GGCTGCCCCT 
TGCCTCCCTC 
TGTTTGTATG 
TTTTCTACCT 



51 
I 

TCTAAGGATG 
ATCTGGCCTG 
ACTTGTCCTC 
GGAGAATAGC 
GATTGGGCGT 
AAGCCATTTC 
GGAGCATCAG 
GCGTCTCTAC 
CCGCGGGGAG 
CACGATCATG 
CAGAGACATA 
CTTCGGCTGG 
CTACCTGCCC 
CATTGGAGTG 
CAACGAGACC 
GGGAGCCCAG 
GGCCCAGGTC 
TGCCCTTCAA 
TCTGTGTATG 
CCTCCTTTGT 



Seq ID NO: 93 Protein Sequence 
Protein Accession #: NP_004208 



MAQKENSYPW 
SSGTPDIIiTR 
QLRREIEIQA 
MEELADALMY 
PPEMIEGRMH 
QDLISKLLRH 



11 
I 

PYGRQTAPSG 
HFTIDDFEIG 
HLHHPNILRL 
CHGKKVIHRD 
NEKVDLWCIG 
NPSERLPLAQ 



21 
1 

LSTLPQRVLR 
RPLGKGKFGN 
YNYFYDRRRI 
IKPENLLLGL 
VLCYELLVGN 
VSAHPWVRAN 



31 
I 

KEPVTPSALV 
VYLAREKXSH 
YLILEYAPRG 
KGELKIADFG 
PPFESASHNE 
SRRVLPPSAL 



Seq ID NO: 94 DNA sequence 

Nucleic Acid Accession &> NM_007019.1 

Coding sequence: 41.-580 



GGCACGAGCG 
CCCAGCCGCC 
CCGGGGTCCG 
TAAAGGGATT 
TGGAGCAGCT 
TGGCTACCCT 
GGACACCCAG 
TGTCAGGACC 



11 

I 

AGTTCCTGTC 
ACTAGCGTCG 
GTGGGCAAAA 
TCTGCCTTCC 
GGAACAGTAT 
TACAATGCGC 
GGTAACATAT 
ATTCTGCTCT 



21 
I 

TCTCTGCCAA 
CCGCCGCCCG 
GGCTACAGCA 
CTGAATCAGA 
ATGAAGACCT 
CCACAGTGAA 
GCCTGGACAT 
CCATCCAGAG 



31 
I 

CGCCGCCCGG 
TAAAGGAGCT 
GGAGCTGATG 
CAACCTTTTC 
GAGGTATAAG 
GTTCCTCACG 
CCTGAAGGAA 
CCTTCTAGGA 



41 
I 

ATGGCTTCCC 
GAGCCGAGCG 
ACCCTCATGA 
AAATGGGTAG 
CTCTCGCTAG 
CCCTGCTATC 
AAGTGGTCTG 
GAACCCAACA 



60 

120 

180 

240 

300 

360 

420 

4B0 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 



41 51 

I I 

LMSRSNVQPT AAPGQKVMEN 
FIVALKVLFK SQIEKEGVEH 
ELYKELQKSC TFDEQRTATI 
WSVHAPSL.RR KTMCGTLDYL 
TYRRIVKVDL KFPASVPTGA 
QSVA 



60 

120 

180 

240 

300 



51 

1 

AAAACCGCGA 
GGGGCGCCGC 
TGTCTGGCGA 
GGACCATCCA 
AGTTCCCCAG 
ACCCCAACGT 
CCCTGTATGA 
TTGATAGTCC 



60 

120 

180 

240 

300 

360 

420 

480 



265 
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CTTGAACACA CATGCTGCCG AGCTCTGGAA AAACCCCACA GCTTTTAAGA AGTACCTGCA 540 

AGAAACCTAC TCAAAGCAGG TCACCAGCCA GGAGCCCTGA CCCAGGCTGC CCAGCCTGTC 600 

CTTGTGTCGT CTTTTTAATT TTTCCTTAGA TGGTCTGTCC TTTTTGTGAT TTCTGTATAG 660 

GACTCTTTAT CTTGAGCTGT GGTATTTTTG TTTTGTTTTT GTCTTTTAAA TTAAGCCTCG 720 

GTTGAGCCCT TGTATATTAA ATAAATGCAT TTTTGTCCTT TTTTAAAAAA AAAAAAAAAA 780 



Seq ID NO: 95 Protein Sequence 
Protein Accession fti NPJ)08950.1 

1 ll 21 31 41 51 

I ] I I i I 

MASQNRDPAA TSVAAARKGA EPSGGAARGP VGKRLQQELM TLMMSGDKGI SAFPESDNLF 

KWVGTIHGAA GTVYEDLRYK LSLEPPSGYP YNAPTVKFLT PCYHPNVDTQ GNICLDILKE 

KWSALYDVRT ILLSIQSLLG EPNIDSPLNT HAAELWKNPT AFKKYLQETY SKQVTSQEP 



60 

120 

179 



Seq ID NO: 96 DNA sequence 
Nucleic Acid Accession tt: AK055663 
Coding sequence: 38.. 1423 



11 



AGAACGGCTT 
AAAACCACAA 
CCGAAGGTCC 
GCTTATGTGG 
TTTTGATCTT 
TAGCCCTGTC 
AGTCTTGGCA 
ACAGCCCGAG 
CCTGTTCACG 
TACGAGCTGG 
GGGACTTAGC 
AGCATTTGCT 
CACTGCCTCT 
GTACAGTGGG 
ACTCATCAGA 
GACCCTAGGT 
TGAACAAATG 
TGTTCAAATT 
CAATGTCCTA 
TGATTTGAAC 
ATTTAACACT 
TTATGGTTTT 
TGGAGTTCCA 
TAGATATGGA 
TTATAAGGAA 
TGTTTAATCA 
TATGAAACTA 
GCTTTAAATA 
GTTTTGTAGT 
AGATCTGTCA 
TGCCACTGTG 
CTTAGTTTTT 
ATGCAGTGGC 
TGCCTCAGCC 
TGTATTTTTA 
ACCTCATGAT 
CCTGGCCGAT 
GGGAAAGGGA 
AATTGCTAAA 
TTTTTAGCAG 
. GATTTTTGTT 



I 

CCGGCGGGAG 
AGATCCTTTT 
TGGAAGATAC 
TGCAGTTCTA 
TTTAGTTTAA 
TATTCATTTG 
CAGTTGGGAG 
ATACACACGG 
ATGCTTTCTA 
CTTCAAGAGC 
AGTATCTTCC 
CTTTGTATTA 
GCTATAGCTA 
AAAGTCTTAC 
GAGGTATCTA 
TTTGGCTCAT 
GTTCTTGCTC 
TTCAAGGATG 
AACTTTTCAG 
CCAGTTACAT 
CCTGGGAAAA 
GGTCTCAATC 
GGAATTGGAG 
ACTAATAATA 
TATTGACTCC 
TTTACTCTAA 
TATTTTTGTA 
GGCTTCCTTT 
TGACTGCAGT 
CATTACTAAG 
CCCGGCCAAT 
GTTTTGTTTT 
ATGATCTCAG 
TCCCGAGTAG 
GTAAAGACGG 
CCACCCACCT 
ATTTTCTTTA 
AAAATGTCTG 
TTTTTCTTTG 
AAATTTTGGA 
AAAGTTTCTC 



21 
I 

CTGTGCAGCT 
TTGGCAAGTT 
TGCTCTTTGG 
CTAATAGTAT 
TGACATGTTT 
GGTTTGAAAG 
CTCTCTTTAT 
GAAGATTATT 
TTCGGAATAA 
ATGTTGCAGA 
TTCCCCGAAT 
CATATATGCT 
TTGCCTTGAT 
TCCAGACAAC 
CCTTAGATGG 
TGGCTGGATC 
ATGTGACCAA 
ACTGGATTAG 
ATCATCACGT 
CAACTCCAGC 
ATGTGAACCC 
ATGGACACAC 
CAACTCAAGG 
GAATTGGACA 
TTGGCTTCCA 
ATGTTAGATA 
AAATGTATTT 
AGAAAATGTG 
GTGATGTGAC 
ATACGATATT 
ACATTATTAT 
GTTTTTTGAG 
CTCACTGCAA 
CTGGGATTAC 
GGGATTTCAC 
TAGCCTCCCA 
ATGAAATTTA 
TTCAAAAAGT 
AGGTTCTCCT 
ATACATTCTA 
TCCTTTAAAA 



31 
1 

CCTTATCATG 
GTTACGGGAA 
TGTAATAAAC 
AGCTTTAACT 
AATAAGTTAC 
ATTAGAAGTC 
ATTAAAAGAA 
AGTTGGTACT 
ACCTTTTGCT 
TCTTAGTCGA 
GAATCCATTT 
CATTGAAATT 
GACATTTGGC 
ACCACCCCAT 
AGTTTTAGAA 
AGTGCATGTA 
CAGGCTGTAC 
GCCTGCCTTA 
AATCCCAATG 
TAAACCTAGT 
AGTTATTCTT 
ACCTTACAGC 
ATTGAGGACT 
ACCAAGACCA 
ATTTATTTAG 
ATAGTAGTCT 
GTGACAGTGA 
TTTCTTTAAA 
CTTACCTTTA 
TCTTTTTTTT 
TAACTTAAGG 
ATGGAGTCTC 
CCTCTGCCTC 
AGGCACCTGC 
CATGTTGGCC 
AAGTGCTGGG 
TAAATATGCT 
AAAGGTCTCT 
GAATTATGTC 
TCTAGCACAA 
ATTTTAGTAC 



41 
I 

GGGACAATTC 
TTTAGACTTG 
TTGATATGTA 
GCCTATACTT 
TGGGTAACAT 
CTGGCTGTAT 
AGTGCAGAAC 
TTTGTGGCTC 
TATGTCTCAG 
AGCTTGTGTG 
GTTTTGATTG 
AATAATTATT 
ACTATGTATC 
GTTATTGGTC 
GTCCGAAATG 
AGAATTCGAC 
ACTCTAGTGT 
TTGTCTGGGC 
CCTCTTTTAA 
AGTCCACCTC 
CTAAACACAC 
AGCATGCTTA 
GGTTTTACAA 
TGATAGACTC 
TAATCCAACT 
TGTTCACATT 
AATCCTCGTA 
TTTGGATTTT 
TAAGAGCCAC 
TCCGAGACGG 
CTGTACTTTA 
ACTCTGTCGC 
CTGAGTTCAA 
CACCACGCCC 
AGGCTGGTCT 
ATTAGGTGTG 
TCTTGAATAA 
TTTATAGCTT 
TTACAAACTA 
TTTGAATTTT 
ATTTGTAAAT 



51 

I 

ATCTCTTTCG 
TAGCAGCTGA 
CTGGCTTCCT 
ACCTGACCAT 
TGAGGAAACC 
TTGCCTCCAC 
GCTTTTTGGA 
TTTGTTTCAA 
AAGCTGCTAG 
GAATTATTCC 
ATCTTGCTGG 
TTGCCGTAGA 
CCATGAGTGT 
AGTTGGACAA 
AACATTTTTG 
GAGATGCCAA 
CTACTCTAAC 
CTGTTGCAGC 
AGGGTACTGA 
CAGAATTTTC 
AAACAAGGCC 
ATCAAGGACT 
ATATACCAAG 
TAACTTATTT 
TTGCATTGAC 
TCATGAAACC 
AATGTTAAAG 
GGTATCTTTG 
TTGATGGAGT 
AGTCTTGCTC 
TTAAGGCTTC 
CCAGGCTGGA 
ATGATTCTCC 
AGCTAATTTT 
TGAACTCCTG 
AGCCACCGCA 
TACACATTTT 
TTCCAAACTT 
AAAGCAAAAA 
TAATTATCAA 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 



Seq ID NO: 97 Protein Sequence 
Protein Accession #: BAB709B0.1 



MGTIHLPRKP 
TAYTYl/TIFD 
ESAERFLEQP 
RSLCGIIPGL 
GTMYPMSVYS 
VRIRRDANEQ 
MPLLKGTDDL 
SSMLNQGLGV 



11 

I 

QRSFFGKLLR 
LFSLMTCLIS 
EIHTGRLLVG 
SSIPLPRMNP 
GKVLLQTTPP 
MVLAHVTNRL 
NPVTSTPAKP 
PGIGATQGLR 



21 
] 

EFRLVAADRR 
YWVTLRKPSP 
TFVALCFNLF 
FVLIDLAGAF 
HVIGQLDKLI 
YTLVSTLTVQ 
SSPPPEFSFN 
TGFTNIPSRY 



31 

I 

SWKILLFGVI 
VYSFGFERLE 
TMLSIRNKPF 
ALCITYMIilE 
REVSTLDGVL 
IFKDDWIRPA 
TPGKNVNPVI 
GTNNRIGQPR 



41 

I 

NLICTGFLLM 
VLAVFASTVL 
AYVSEAASTS 
INNYFAVDTA 
EVRNEHFWTL 
LLSGPVAANV 
LLNTQTRPYG 
P 



51 

WCSSTNSIA1* 60 

AQLGALFILK 120 

WLQEHVADLS 180 

SAIAIALMTP 240 

GFGSLAGSVH 300 

LNFSDHHVIP 360 

FGLNHGHTPY 420 



266 
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Seq ID NO: 98 DNA sequence 

Nucleic Acid Accession ftt CAT cluster 

1 11 21 31 41 51 

ATGGACTTAA CCAATGCTTC ACAAATACCA AGAAAAAAAT ATTTCAATGT GATAAATATA 60 

GGAAAGTTTT TCATAAATTT TCAAATTCAA ACTGACATAA GAGAAGACAT ACGGGAAAAA 120 

AGACCTTTCA AATGTATAGA ATGTGGCAAA GCTTTTAACA AGTTTTCAAC CCTTACTACA 180 

CATAAGAAAA TTCATACTAG AGAGAAACCC TACAAATGTG AAGAATGTGG CAAAGCCTTC 240 

AACTGGTCCT CATACCTTAC TATACATAAG AGAATTCATA CTGGAGAGAA ACTCTACAAA 300 

TGTGAAGAAT GTGGCAAAGC CTTCAACTGG TCCTCATACC TTACTGCATA TAAGATAACT 360 

CATACTAGAG AGAAACCCTA CAAATGTGAA GAATGTGGCA AAGCCTTTAA GTACTCCTCG 420 

AACCTCACTA CACATAAGAT AATTCATACT GGAGAGCATC TCTACAAGTG TGAAGAATGT 480 
GGCAAAGCGT TTAACCATGC TGCATGCTTC TTTGTCATTT TGAAGATTTG A 

Seq ID NO i 99 DNA sequence 

Nucleic Acid Accession ft: FGENESH predicted 

Coding sequence : 1 . .531 

1 11 21 31 41 51 

ATGGACTTAA CCAATGCTTC ACAAATACCA AGAAAAAAAT ATTTCAATGT GATAAATATA 60 

GGAAAGTTTT TCATAAATTT TCAAATTCAA ACTGACATAA GAGAAGACAT ACGGGAAAAA 120 

AGACCTTTCA AATGTATAGA ATGTGGCAAA GCTTTTAACA AGTTTTCAAC CCTTACTACA 180 

CATAAGAAAA TTCATACTAG AGAGAAACCC TACAAATGTG AAGAATGTGG CAAAGCCTTC 240 

AACTGGTCCT CATACCTTAC TATACATAAG AGAATTCATA CTGGAGAGAA ACTCTACAAA 300 

TGTGAAGAAT GTGGCAAAGC CTTCAACTGG TCCTCATACC TTACTGCATA TAAGATAACT 360 

CATACTAGAG AGAAACCCTA CAAATGTGAA GAATGTGGCA AAGCCTTTAA GTACTCCTCG 420 

AACCTCACTA CACATAAGAT AATTCATACT GGAGAGCATC TCTACAAGTG TGAAGAATGT 480 
GGCAAAGCGT TTAACCATGC TGCATGCTTC TTTGTCATTT TGAAGATTTG A 

Seq ID NO: 100 Protein Sequence 
Protein Accession ftt FGENESH predicted 

1 11 21 31 41 51 

Lltnasgip Lkypnvini GKFFINFQIQ TDIREDIREK rpfkciecgk afnkfstltt 

HKKIHTREKP YKCEECGKAF NWSSYLTIHK RIHTGEKLYK CEECGKAFNW SSYLTAYKIT 

HTREKPYKCE ecgkafkyss nltthkiiht gehlykceec gkafnhaacf fvilki 



60 
120 



60 
120 
180 
240 
300 
360 
420 
480 



Seq ID NO: 101 DNA sequence 
Nucleic Acid Accession ft: NM_032589.1 
Coding sequence: 147-422 

1 11 i 1 i 1 r i 1 

ggcacgaggt cttgttatgc aaataggctt cccacttggc aggggccgtc ttgtccactc 
gtttctgtaa acatgggtgg caaaaagaga agatggagct gccatttaga ^tgcctaa 

TCCCAGCTTC ATCTTGCTGA GCAAAAATGA AGGAGCCTGG ACCCAACTTT OTACTOTO* 
GAAAGGGTCT TCATTCATTC AAGATGGCAT TTGTTAAGCA CCTACTGCTG GAGTGCAdTG 
GTTCAATCAC GGATCACTGC AGCCTCCACC TCCCAGTTCA AGAAATTCTC ATGTCTCAGC 
CTCCTGAGCA GCTAGGATTA CAGACAAACC TTGGAAATCA AGAAAGTTCT GGAATGATGA 
AGCTGTTCAT GCCAAGACCG AAAGTGCTGG CCCAGTATGA GTCCATTCAG TTCATGCCGT 
GACAATTTTC TTGGAACTCC TTTTTATTGT TAGTTCTCAC TTGTTTCCAT ATTTAGTGAA 
TGTACATTTA ATTGCAAAGC TGTCATTAAT AAAAATTCTT ATAGTACCTC ACTGCAC 

Seq ID NO: 102 Protein sequence i 
Protein Accession ft: NP_115978.1 

1 11 21 |1 I 1 ^ 

Ikepgpnfvt Irkglhsfkm Ifvkhlllec SGSITDHCSL HLPVQEILMS QPPEQLGLQT 60 

NLGNQESSGM MKLFMPRPKV LAQYESIQFM P 

Seq ID NO: 103 DNA sequence 

Nucleic Acid Accession ft: NMJJ14501.1 

Coding sequence: 60.. 737 

1 U 21 » « f l 

GGCGGACCGA AGAACGCAGG AAGGGGGCCG GGGGGACCCG CCCCCGGCCG GCCGCftGCCA SO 
TGAACTCCAA CGTGGAGAAC CTACCCCCGC ACATCATCCG CCTGGTGTAC AAGGAGGTGA 120 
CGACACTGAC CGCAGACCCA CCCGATGGCA TCAAGGTCTT TCCCAACGAG GAGWCCTCA 180 
cSacctcca GGTCACCATC GAGOGOCCTG AGGGGACCCC ATATGCTGGA GGTCTGTTCC 240 
GCATGAAACT CCTGCTGGGG AAGQACTTCC CTGCCTCCCC ACCCAAGGGC TACTTCCTGA 300 
CCAAGATCTT CCACCCGAAC GTGGGCGCCA ATGGCGAGAT CTGCGTCAAC GTCCTCAAGA 360 
GGGACTGGAC GGCTGAGCTG GGCATCCGAC ACGTACTGCT GACCATCAAG TGCCTGCTGA 420 

267 



WO 03/003906 



TCCACCCTAA CCCCGAGTCT GCACTCAACG AGGAGGCGGG CCGCCTGCTC TTGGAGAACT 480 

ACGAGGAGTA TGCGQCTCGQ GCCCGTCTGC TCACAGAGAT CCACGOGOGC GCCGGCGGGC 540 

CCAGCGGCAG GGCCGAAGCC GGTCGGGCCC TGGCCAGTGG CACTGAAGCT TCCTCCACCG 600 

ACCCTGGGGC CCCAGGGGGC CCGGGAGGGG CTGAGGGTCC CATGGCCAAG AAGCATGCTG 660 

GCGAGCGCGA TAAGAAGCTG GCGGCCAAGA AAAAGACGGA CAAGAAGCGG GCGCTGCGGG 720 

CGCTGCGGCG GCTGTAGTGG GCTCTCTTCC TCCTTCCACC GTGACCCCAA CCTCTCCTGT 780 

CCCCTCCCTC CAACTCTGTC TCTAAGTTAT TTAAATTATG GCTGGGGTCG GGGAGGGTAC 840 
AGGGGGCACT GGGACCTGGA TTTGTTTTTC TAAATAAAGT TGGAAAAGCA 

Seq ID NO: 104 Protein Sequence 
Protein Accession ft: ^^055316.1 

1 11 21 31 41 51 

MNSNVENLPP HIIRLVYKEV TTLTADPPDG IKVFPNEEDL TDliQVTIEGP EGTPYAGGLP 60 

RMKLLLGKDF PASPPKGYFL TKIFHPNVGA NGEICVNVLK RDWTAELGIR HVU/TIKCLL 120 

IHPNPESALN EEAGRLLLEN YEEYAARARL LTEIHGGAGG PSGRAEAGRA LASGTEASST 180 
DPGAPGGPGG AEGPMAKKHA GERDKKLAAK KKTDKKRALR ALRRL 



60 
120 



Seq ID NO: 105 DNA sequence 
Nucleic Acid Accession Jh NM_005101 
Coding sequences 76.. 573 

1 11 21 31 41 51 

CGGCTGAGAG GCAGCGAACT CATCTTTGCC AGTACAGGAG CTTGTGCCGT GGGCCACAGC 60 

CCACAGCCCA CAGCCATGGG CTGGGACCTG ACGGTGAAGA TGCTGGCGGG CAACGAATTC 120 

CAGGTGTCCC TGAGCAGCTC CATGTCGGTG TCAGAGCTGA AGGCGCAGAT CACCCAGAAG 180 

ATTGGCGTGC ACGCCTTCCA GCAGCGTCTG GCTGTCCACC CGAGCGGTGT GGCGCTGCAG 240 

GACAGGGTCC CCCTTGCCAG CCAGGGCCTG GGCCCTGGCA GCACGGTCCT GCTGGTGGTG 300 

GACAAATGCG ACGAACCTCT GAGCATCCTG GTGAGGAATA ACAAGGGCCG CAGCAGCACC 360 

TACGAGGTCC GGCTGACGCA GACCGTGGCC CACCTGAAGC AGCAAGTGAG CGGGCTGGAG 420 

GGTGTGCAGG ACGACCTGTT CTGGCTGACC TTCGAGGGGA AGCCCCTGGA GGACCAGCTC 480 

CCGCTGGGGG AGTACGGCCT CAAGCCCCTG AGCACCGTGT TCATGAATCT GGGOTGCGG 540 

GGAGGCGGCA CAGAGCCTGG CGGGCGGAGC TAAGGGCCTC CACCAGCATC CGAGCAGGAT 600 
CAAGGGCCGG AAATAAAGGC TGTTGTAAGA GAAT 

Seq ID NO: 106 Protein Sequence 
Protein Accession 8: NP_005092 

1 11 21 31 41 51 

MGWDLTVKML AGNEFQVSLS SSMSVSELKA QITQKIGVHA FQQRLAVHPS fVALQDRVPL 
ASQGLGPGST VLLWDKCDE PLSILVRNNK GRSSTYEVRL TQTVAHLKQQ VSGLEGVQDD 
LFWLTFEGKP LEDOLPLGEY GLKPLSTVFM NLRLRGGGTE PGGRS 

Seq ID NO: 107 DNA sequence 
Nucleic Acid Accession #: NM_00601B 
Coding sequence: 1..1161 

1 11 21 31 41 * l 

ATGAATCGGC ACCATCTGCA GGATCACTTT CTGGAAATAG ACAAGAAGAA CTGCTGTGTG 60 

^K^AGATG ACTTCATTGC CAAGGTGTTG CCGCCGGTGT TGGGGCTGGA «™TCTTT 12 
GGGCTTCTGG GCAATGGCCT TGCCCTGTGG ATTTTCTGTT TCCACCTCAA GTCCTGGAAA 
TCCAGCCGGA TTTTCCTGTT CAACCTGGCA GTAGCTGACT TTCTACTGAT GATCTGCCTG 

££rS* TGGACTACTA TGTGCGGCGT TCAGACTGGA AGTTTGGGGA CATCCCTTGC 300 

ScTGGTGC TCTTCATGTT TGCCATGAAC CGCCAGGGCA GCATCATCTT CCTCACGGTG 360 

GTGGCGGTAG ACAGGTATTT CCGGGTGGTC CATCCCCACC ACGCCCTGAA CAAGATCTCC 420 

CTCTTGCCTT CTGTGGGGCA TCACTGTTGG CCTAACAGTC 480 

CACCTCCTGA AGAAGAAGTT GCTGATCCAG AATGGCCCTG CAAATGTGTG CATCAGCTTC 540 

ESSSS ATACCTTCCG GTGGCACGAA GCTATGTTCC TCCTGGAGTT CCTCCTGCCC 600 

SSSS TCCTGTTCTG CTCAGCCAGA ATTATCTGGA GCCTGCGGCA GAGACAAATG 660 

SSatc ccaagatcaa gagagccatc accttcatca tggtggtggc gatcgtcttt 
Stcatctgct tccttcccag cgtggttgtg cggatccgca tcttctggct cctgcacact 
?2SSc agaattgtga agtgtaccgc tcggtggacc tggcgttctt tatcactctc 
SScct acatgaacag catgctggac cccgtggtgt actacttctc cagcccatcc 
£SS£aact tcttctccac tttgatcaac cgcigcctcc agaggaagat gacaggtgag 
cSgataata accgcagcac gagcgtcgag ctcacagggg accccaacaa aaccagaggc 

£££££ CGTTAATGGC CAACTCCGGT GAGCCATGGA GCCCCTCTTA TCTGGGCCCA 
ACCTCAAATA ACCATTCCAA GAAGGGACAT TGTCACCAAG AACCAGCATC TCTGGAGAAA 
CAGTTGGGAT GTTGCATCGA G 

Seq ID KOs 108 Protein Sequence 
Protein Accession If s NP_006009.1 

x IX 21 31 41 51 

268 



120 
180 
240 



660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
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| | I I I I 

MNRHHLQDHF LEIDKKNCCV FRDDFIAKVL PPVLGLEFIF GLLGNGLALW I FCFHLKSWK 60 

SSRIFLFNLA VADFLLIICL PFVMDYTVRR SDWKFGDIPC RLVLFMFAMN RQGSIIFLTV 120 

VAVDRYFRW HPHHALNKIS NWTAAIISCL LWGITVQLTV HLXjKKKLLIQ NGPANVCISF 180 

SICHTFRWHE AMFLLEFLLP LGIILFCSAR IIWSLRQRQM DRHAKIKRAI TFIMWAIVF 240 

VICFLPSWV RIRIFWLLHT SGTQNCEVYR SVDLAFFITL SFTYMNSMLD PWYYFSSPS 300 

FPNFFSTLIN RCLQRKMTGE PDNNRSTSVE LTGDPNKTRG APEALMANSG EPWSPSYLGP 360 
TSNNHSKKGH CHQEPASLEK QLGCCIE 

Seq ID NOt 109 dna sequence 

Nucleic Acid Accession #: NM_005030.2 

Coding sequence: 63.. 1874 

1 11 21 31 41 51 

GGCACGAGGG AGCGGTGCGG AGGCTCTGCT CGGATCGAGG TCTGCAGCGC AGCTTCGGGA 60 

GCATGAGTGC TGCAGTGACT GCAGGGAAGC TGGCACGGGC ACCGGCCGAC CCTGGGAAAG 120 

CCGGGGTCCC CGGAGTTGCA GCTCCCGGAG CTCCGGCGGC GGCTCCACCG GC GAAAG AGA 1B0 

TCCCGGAGGT CCTAGTGGAC CCACGCAGCC GGCGGCGCTA TGTGCGGGGC CGCTTTTTGG 240 

GCAAGGGCGG CTTTGCCAAG TGCTTCGAGA TCTCGGACGC GGACACCAAG GAGGTGTTCG 300 

CGGGCAAGAT TGTGCCTAAG TCTCTGCTGC TCAAGCCGCA CCAGAGGGAG AAGATGTCCA 360 

TGGAAATATC CATTCACCGC AGCCTCGCCC ACCAGCACGT CGTAGGATTC CACGGCTTTT 420 

TCGAGGACAA CGACTTCGTG TTCGTGGTGT TGGAGCTCTG CCGCCGGAGG TCTCTCCTGG 480 

AGCTGCACAA GAGGAGGAAA GCCCTGACTG AGCCTGAGGC CCGATACTAC CTACGGCAAA 540 

TTGTGCTTGG CTGCCAGTAC CTGCACCGAA ACCGAGTTAT TCATCGAGAC CTCAAGCTGG 600 

GCAACCTTTT CCTGAATGAA GATCTGGAGG TGAAAATAGG GGATTTTGGA CTGGCAACCA 660 

AAGTCGAATA TGACGGGGAG AGGAAGAAGA CCCTGTGTGG GACTCCTAAT TACATAGCTC 720 

CCGAGGTGCT GAGCAAGAAA GGGCACAGTT TCGAGGTGGA TGTGTGGTCC ATTGGGTGTA 780 

TCATGTATAC CTTGTTAGTG GGCAAACCAC CTTTTGAGAC TTCTTGCCTA AAAGAGACCT 840 

ACCTCCGGAT CAAGAAGAAT GAATACAGTA TTCCCAAGCA CATCAACCCC GTGGCCGCCT 900 

CCCTCATCCA GAAGATGCTT CAGACAGATC CCACTGCCCG CCCAACCATT AACGAGCTGC 960 

TTAATGACGA GTTCTTTACT TCTGGCTATA TCCCTGCCCG TCTCCCCATC ACCTGCCTGA 1020 

CCATTCCACC AAGGTTTTCG ATTGCTCCCA GCAGCCTGGA CCCCAGCAAC CGGAAGCCCC 1080 

TCACAGTCCT CAATAAAGGC TTGGAGAACC CCCTGCCTGA GCGTCCCCGG GAAAAAGAAG 1140 

AACCAGTGGT TCGAGAGACA GGTGAGGTGG TCGACTGCCA CCTCAGTGAC ATGCTGCAGC 1200 

AGCTGCACAG TGTCAATGCC TCCAAGCCCT CGGAGCGTGG GCTGGTCAGG CAAGAGGAGG 1260 

CTGAGGATCC TGCCTGCATC CCCATCTTCT GGGTCAGCAA GTGGGTGGAC TATTCGGACA 1320 

AGTACGGCCT TGGGTATCAG CTCTGTGATA ACAGCGTGGG GGTGCTCTTC AATGACTCAA 1380 
CACGCCTCAT CCTCTACAAT GATGGTGACA GCCTGCAGTA CATAGAGCGT GACGGCACTG 1440 
AGTCCTACCT CACCGTGAGT TCCCATCCCA ACTCCTTGAT GAAGAAGATC ACCCTCCTTA 1500 
AATATTTCCG CAATTACATG AGCGAGCACT TGCTGAAGGC AGGTGCCAAC ATCACGCCGC 1560 
GCGAAGGTGA TGAGCTCGCC CGGCTGCCCT ACCTACGGAC CTGGTTCCGC ACCCGCAGCG 1620 
CCATCATCCT GCACCTCAGC AACGGCAGCG TGCAGATCAA CTTCTTCCAG GATCACACCA 1680 
AGCTCATCTT GTGCCCACTG ATGGCAGCCG TGACCTACAT CGACGAGAAG CGGGACTTCC 1740 
GCaStACCG CCTGAGTCTC CTGGAGGAGT ACGGCTGCTG CAAGGAGCTG GCCAGCCGGC 1800 
TCC^SaC^ cScACTATG GTGGACAAGC TGCTGAGCTC ACGCTCGGCC AG^AACCGTC 1860 
TCAAGGCCTC CTAATAGCTG CCCTCCCCTC CGGACTGGTG CCCTCCTCAC TCC^CCTGC 1920 
ATCTGGGGCC CATACTGGTT GGCTCCCGCG GTGCCATGTC TGCAGTGTGC CCCCCAGCCC 1980 
Tvc-TCGcrGG GCAGAGCTGC ATCATCCTTG CAGGTGGGGG TTGCTGTGTA AGTTATTTTT 2040 
SaStg^ £££££ ?£acagcct tgtccccctc CCCCTCAACC CCACCATATG 2100 
StACAG AATATTTCTA TTCAATTCGG AACTGTCCTT TCCTTGGCTT TATGCACATT 2160 
AAACAGATGT GAATATTCAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAA 



Seq ID NO: 110 Protein Sequence 
Protein Accession Jh NPJ)0S021.2 



1 11 i 1 " i 1 I 1 

MSAAVTAGKL ARAPADPGKA GVPGVAAPGA PAAAPPAKEI PEVLVDPRSR RRYVRGRFLG 
S^VFA GKIVPKSLIi KPHQREXMSM EISIHRSLAH Q™GFF 
EDNDPVFWl* ELCRRRSLLE LHKRRKALTE PEARYYLRQI VLGCQYLHRN WlfflMM 
NLFLNEDLEV KIGDFGLATK VEYDGERKKT LCGTPNYIAP EVLSKKGHSF EVDVMS1GCI 

,, t n imTkrmrn&C T.Tn WMT .ATTIP TARPTINELL 



60 
120 
180 
240 
300 
360 
420 
480 
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600 



NLFLNEDLEV KIODTOIjATR. vaiw«^« ^nTWTvrotf 

MYTLLVGKPP FETSCLKETY LRIKKNEYSI PKHINPVAAS LIQKMLQTDP TARPTINELL 

SdIf^sgyi parlpitclt ipprfsiaps sldpsnrkpl tvlnkglenp ^erprekee 

PWRETGEW DCHLSDMLQQ LHSVNASKPS ERGLVRQEEA EDPACIPIFW VSKWVDYSDK 

ygIgSn ^vlfnds? rlilyndgds lqyierdgte syltvsshpn slmkkitllk 

YFRNYMSEHL LKAGANITPR EGDELARLPY LRTWFRTRSA IILHLSNGSV QINFFQDHTK 

ISSS S^fr Sruslleey gcckelasrl ryartmvdkl lssrsasnrl 

KAS 

Seq ID NOt 111 DNA sequence 

Nucleic Acid Accession #: NM_005409.3 

Coding sequence: 94.. 378 

1 11 21 31 41 SI 

TTCCTTTCAT GTTCAGCATT TCTACTCCTT CCAAGAAGAG CAGCAAAGCT GAAGTAGCAG 60 
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CAACAGCACC AGCAGCAACA GCAAAAAACA AACATGAGTG TGAAGGGCAT GGCTATAGCC 120 

TTGGCTGTGA TATTGTGTGC TACAGTTGTT CAAGGCTTCC CCATGTTCAA AAGAGGACGC 180 

TGTCTTTGCA TAGGCCCTGG GGTAAAAGCA GTGAAAGTGG CAGATATTGA GAAAGCCTCC 240 

ATAATGTACC CAAGTAACAA CTGTGACAAA ATAGAAGTGA TTATTACCCT GAAAGAAAAT 300 

AAAGGACAAC GATGCCTAAA TCCCAAATCG AAGCAAGCAA GGCTTATAAT CAAAAAAGTT 3 60 

GAAAGAAAGA ATTTTTAAAA ATATCAAAAC ATATGAAGTC CTGGAAAAGG GCATCTGAAA 420 

AACCTAGAAC AAGTTTAACT GTGACTACTG AAATGACAAG AATTCTACAG TAGGAAACTG 480 

AGACTTTTCT ATGGTTTTGT GACTTTCAAC TTTTGTACAG TTATGTGAAG GATGAAAGGT 540 

GGGTGAAAGG ACCAAAAACA GAAATACAGT CTTCCTGAAT GAATGACAAT CAGAATTCCA 600 

CTGCCCAAAG GAGTCCAGCA ATTAAATGGA TTTCTAGGAA AAGCTACCTT AAGAAAGGCT 660 

GGTTACCATC GGAGTTTACA AAGTGCTTTC ACGTTCTTAC TTGTTGTATT ATACATTCAT 720 

GCATTTCTAG GCTAGAGAAC CTTCTAGATT TGATGCTTAC AACTATTCTG TTGTGACTAT 780 

GAGAACATTT CTGTCTCTAG AAGTTATCTG TCTGTATTGA TCTTTATGCT ATATTACTAT 840 

CTGTGGTTAC AGTGGAGACA TTGACATTAT TACTGGAGTC AAGCCCTTAT AAGTCAAAAG 900 

CATCTATGTG TCGTAAAGCA TTCCTCAAAC ATTTTTTCAT GCAAATACAC ACTTCTTTCC 960 

CCAAATATCA TGTAGCACAT CAATATGTAG GGAAACATTC TTATGCATCA TTTGGTTTGT 1020 

TTTATAACCA ATTCATTAAA TGTAATTCAT AAAATGTACT ATGAAAAAAA TTATACGCTA 1080 

TGGGATACTG GCAACAGTGC ACATATTTCA TAACCAAATT AGCAGCACCG GTCTTAATTT 1140 

GATGTTTTTC AACTTTTATT CATTGAGATG TTTTGAAGCA ATTAGGATAT GTGTGTTTAC 1200 

TGTACTTTTT GTTTTGATCC GTTTGTATAA ATGATAGCAA TATCTTGGAC ACATTTGAAA 1260 

TACAAAATGT TTTTGTCTAC CAAAGAAAAA TGTTGAAAAA TAAGCAAATG TATACCTAGC 1320 

AATCACTTTT ACTTTTTGTA ATTCTGTCTC TTAGAAAAAT ACATAATCTA ATCAATTTCT 1380 

TTGTTCATGC CTATATACTG TAAAATTTAG GTATACTCAA GACTAGTTTA AAGAATCAAA 1440 
GTCATTTTTT TCTCTAATAA ACTACCACAA CCTTTCTTTT TTAAAAAAAA AAA 

Seq ID NO: 112 Protein Sequence 
Protein Accession #: NP_005400.1 

1 11 21 31 41 51 

MSVKGMAIAL AVILCATWQ GFPMFKRGRC LCIGPGVKAV KVADIEKASI MYPSNNCDKI 60 
EVIITLKENK GQRCLNPKSK QARLIIKKVE RKNP 



Seq ID NO: 113 DNA sequence 

Nucleic Acid Accession ft: NM_001110.1 

Coding sequence: 470-. 2716 



GAATTCGAGG 
CGCGCGCATG 
GCGGCTCCGC 
TGAGTTTCGA 
GGAGGAAGGA 
CCGGCCCCTG 
CCGGCTTCCC 
CTGCTGTTAA 
GAGAGTGTTA 
TTTAAATAAA 
AAAACACCAG 
CCATGCCCAT 
TGAATTTAAA 
TGGACATATT 
TGAAGGATTC 
TAAAGACCGA 
CCATAAATAC 
ATACCAGATG 
TGGTCCAGAA 
TTATATTCAG 
CCAGATATCC 
. AATCCGTAAC 
GGACCCTACA 
TTCTGAGCAG 
TGATGGCGTA 
TGAAAAAAGT 
TGTTCAGAAC 
AGTTGGACAT 
TAAGAATTTG 
GGACAAACTT 
TGAGAAGAAG 
GGTAGAACAA 
CTTCGATGCA 
TCCAAGTCAA 
TCGGGATGAT 
AGCATCTGAC 
TGGGCAATGT 
TTCTGATGGC 



11 
I 

ATCCGGGTAC 
CGCGCGCCCC 
GTGCCAGTTG 
GGGAGGGGGG 
AACGAACGAG 
AAGTGGAGCG 
GTGGAGGCTC 
CCCGTGAGGA 
ATTCTGCTCC 
TATATCAGAC 
CGTGCCAAAA 
GGAAGACATT 
GTAGAAACAT 
TATGGTGAAG 
ATCCAGACTC 
ACTCTGCCAT 
GGTCCTCAGG 
ACTGGTGTAG 
CTTCTGAGGA 
ACTGATCATT 
AGTCATGTTA 
ATCAGTTTCA 
AATCCTTTCC 
AATCATGATG 
CTTGGTCTGG 
AAACTCTATT 
TATGGGTCTC 
AACTTTGGAT 
GGTCAAAAAG 
AACAACAATA 
AGAAACAACT 
GGTGAAGAAT 
AATCAACCAG 
GGTCCTTGTT 
TCAGACTGTG 
CCTAAACCAA 
GCAGGTTCTA 
AAAGATGATA 



21 



CATGGGCGGC 
TGAAGCGCCT 
GGTGCCCGCG 
GAGAGAAGAG 
GGGGAGGGAG 
AGAGGGAGGT 
CGGACCAAGC 
GGCGGCGGCG 
TCTCCTGGGC 
ATTATGAAGG 
GAGCAGTCTC 
TCAACCTACG 
CAAATAAAGT 
AAGGAAGTTT 
GTGGTGGCAC 
TTCACTCTGT 
GGGGCTGTGC 
AGGAAGTAAC 
AAAAACGTAC 
TGTTCTTTAA 
AAGCGATTGA 
TGGTGAAACG 
GTTTCCCAAA 
ACTACTGTTT 
CTTGGGTTGG 
CAGATGGTAA 
ATGTACCTCC 
CCCCACATGA 
AAAATGGCAA 
AATTCTCACT 
GTTTTGTTGA 
GTGATTGTGG 
AGGGAAGAAA 
GTACAGCACA 
CAAGGGAAGG 
ACTTCACAGA 
TCTGTGAGAA 
AAGAATTATG 



31 
1 



41 



51 



GGCAGGCCTA 
GGGGGACGGG 
CGTCACGTGG 
GGAACGAGCA 
GTCCCTGTTT 
GCTTCGCCGT 
CCCTTCAGCT 
GCGGCAGCGG 
GGCGGGGATG 
ATTATCTTAC 
ACATGAAGAC 
AATGAAGAGG 
ACTTGATTAT 
TAGCCATGGG 
ATTTTATGTT 
CATTTATCAT 
AGATCAXTCA 
ACAGATACCT 
AACTTCAGCT 
ATATTACGGA 
TACAATTTAC 
CATAAGAATC 
TATTGGTGTG 
GGCCTATGTC 
AGCACCTTCA 
GAAGAAGTCC 
CAAAGTCTCT 
TTCTGGAACA 
TTACATCATG 
CTGTAGTATT 
ATCTGGCCAA 
CTATAGTGAC 
ATGCAAACTG 
GTGTGCATTC 
AATATGTAAT 
CTGTAATAGG 
ATATGGCTTA 
CCATGTATGC 



GCAGCACGGG 
TATGGGCGGG 
TGAGGAAGGA 
AGGGAAGGAA 
TGGAGGAGCT 
TTCTCCTGCC 
TCTCCCTCCG 
CAGCGGAAGA 
GGAGGTCAGT 
AATGTGGATT 
CAATTTTTAC 
GACACTTCCC 
GATACCTCTC 
TCTGTTATTG 
GAGCCAGCAG 
GAAGATGATA 
GTATTTGAAA 
CAAGAAGAAC 
GAAAAAAATA 
ACACGAGAAG 
CAGACCACAG 
AATACAACTG 
GAGAAGTTTC 
TTCACAGACC 
GGAAGCTCTG 
TTAAACACTG 
CACATTACTT 
GAGTGCACAC 
TATGCAAGAG 
AGAAATATAA 
CCTATTTGTG 
CAGTGTAAAG 
AAACCTGGGA 
AAGTCAAAGT 
GGCTTCACAG 
CATACACAAG 
GAGGAGTGTA 
TGTATGAAGA 



AACCGTCCCC 
AGGTAGGGGC 
GGCGGAGGTC 
AGCGGGGAAA 
AGGAGCGTTG 
AGGGGAGGTC 
GATCGATGTG 
TGGTGTTGCT 
ATGGGAATCC 
CATTACACCA 
GTCTAGATTT 
TTTTCAGTGA 
ATATTTACAC 
ATGGAAGATT 
AGAGATATAT 
TTAACTATCC 
GAATGAGGAA 
ATGCTGCTAA 
CTTGTCAGCT 
CTGTGATTGC 
ACTTCTCCGG 
CTGATGAGAA 
TGGAATTGAA 
GAGATTTTGA 
GAGGAATATG 
GAATTATTAC 
TTGCTCACGA 
CAGGAGAATC 
CAACATCTGG 
GCCAAGTTCT 
GAAATGGAAT 
ATGAATGCTG 
AACAGTGCAG 
CTGAGAAGTG 
CTCTCTGCCC 
TGTGCATTAA 
CGTGTGCCAG 
AAATGGACCC 
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960 

1020 

1080 

1140 
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1440 

1500 
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ATCAACTTGT GCCAGTACAG GGTCTGTGCA GTGGAGTAGG CACTTCAGTG GTCGAACCAT 2340 

CACCCTGCAA CCTGGATCCC CTTGCAACGA TTTTAGAGGT TACTGTGATG TTTTCATGCG 2400 

GTGCAGATTA GTAGATGCTQ ATGGTCCTCT AGCTAGGCTT AAAAAAGCAA TTTTTAGTCC 2460 

AGAGCTCTAT GAAAACATTG CTGAATGGAT TGTGGCTCAT TGGTGGGCAG TATTACTTAT 2520 

GGGAATTGCT CTGATCATGC TAATGGCTGG ATTTATTAAG ATATGCAGTG TTCATACTCC 2580 

AAGTAGTAAT CCAAAGTTGC CTCCTCCTAA ACCACTTCCA GGCACTTTAA AGAGQAGGAG 2640 

ACCTCCACAG CCCATTCAGC AACCCCAGCG TCAGCGGCCC CGAGAGAGTT ATCAAATGGG 2700 

ACACATGAGA CGCTAACTGC AGCTTTTGCC TTGGTTCTTC CTAGTGCCTA CAATGGGAAA 2760 

ACTTCACTCC AAAGAGAAAC CTATTAAGTC ATCATCTCCA AACTAAACCC TCACAAGTAA 2820 

CAGTTGAAGA AAAAATGGCA AGAGATCATA TCCTCAGACC AGGTGGAATT ACTTAAATTT 2880 

TAAAGCCTGA AAATTCCAAT TTGGGGGTGG GAGGTGGAAA AGGAACCCAA TTTTCTTATG 2940 

AACAGATATT TTTAACTTAA TGGCACAAAG TCTTAGAATA TTATTATGTG CCCCGTGTTC 3000 

CCTGTTCTTC GTTGCTGCAT TTTCTTCACT TGCAGGCAAA CTTGGCTCTC AATAAACTTT 3060 

TACCACAAAT TGAAATAAAT ATATTTTTTT CAACTGCCAA TCAAGGCTAG GAGGCTCGAC 3120 

CACCTCAACA TTGGAGACAT CACTTGCCAA TGTACATACC TTGTTATATG CAGACATGTA 3180 

TTTCTTACGT ACACTGTACT TCTGTGTGCA ATTGTAAACA GAAATTGCAA TATGGATGTT 3240 

TCTTTGTATT ATAAAATTTT TCCGCTCTTA ATTAAAAATT ACTGTTTAAT TGACATACTC 3300 

AGGATAACAG AGAATGGTGG TATTCAGTGG TCCAGGATTC TGTAATGCTT TACACAGGCA 3360 
GTTTTGAAAT GAAAATCAAT TTACCCCATG GTACCCGGAT CCTCGAATTC 

Seq ID NO: 114 Protein Sequence 
Protein Accession ft: NP_001101.1 



MVLLRVLILL LSWAAGMGGQ YGNPLNKYIR HYEGLSYNVD SLHQKHQRAK RAVSHEDQFL 60 

RLDFHAHGRH FNLRMKRDTS LFSDEFKVET SNKVLDYDTS HIYTGHIYGE EGSFSHGSVI 120 

DGRFEGFIQT RGGTFYVEPA ERYIKDRTLP FHSVIYHEDD INYPHKYGPQ GGCADHSVFE 160 

RMRKYQMTGV EEVTQIPQEE HAANGPELLR KKRTTSAEKN TCQLYIQTDH LFFKYYGTRE 240 

AVIAQISSHV KAIDTIYQTT DFSGIRNISF MVKRIRINTT ADEKDPTNPF RFPNIGVEKF 3 00 

LELNSEQNHD DYCLAYVFTD RDFDDGVLGL AWVGAPSGSS GGICEKSKLY SDGKKKSLNT 360 

GIITVQNYGS HVPPKVSHIT FAHEVGHNFG SPHDSGTECT PGESKNLGQK ENGNYIMYAR 420 

ATSGDKLNNN KFSLCSIRNI SQVLEKKRNN CFVESGQPIC GNGMVEQGEE CDCGYSDQCK 480 

DECCFDANQP EGRKCKLKPG KQCSPSQGPC CTAQCAFKSK SEKCRDDSDC AREGICNGFT 540 

ALCPASDPKP NFTDCNRHTQ VCINGQCAGS ICEKYGLEEC TCASSDGKDD KELCKVCCMK 600 

KMDPSTCAST GSVQWSRHFS GRTITLOPGS PCNDFRGYCD VFMRCRLVDA DGPLARLKKA 660 

IFSPELYENI AEWIVAHWWA VLLMGIALIM LMAGFIKICS VHTPSSNPKL PPPKPLPGTL 720 
KRRRPPQPIQ QPQRQRPRES YQMGHMRR 

Seq ID NO: 115 DNA sequence 

Nucleic Acid Accession ft: NM_000577.2 

Coding sequence: 41„.520 



1 
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1440 
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AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAA 

Seq ID NO: 116 Protein Sequence 
Protein Accession th NPJ) 005 68.1 
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1 11 21 31 41 51 

I | 1 I I I 

MALETICRPS GRKSSKMQAF RIWDVNQKTF YLRNNQM/AG YLQGPNVNLE EKIDWPIEP 60 
HALFLGIHGG KMCLSCVKSG DETRLQLEAV NITDLSEKRK QDKRFAFIRS DSGPTTSFBS 120 
AACPGWFLCT AMEADQPVSL TNMPDEGVMV TKFYFQEDE 



Seq ID NO: 117 DNA sequence 

Nucleic Acid Accession 8: CAT cluster 

1 11 21 31 |l 51 

GCACTTTAGC TCAGTGATGG ATTTTAAGAA CGATTACACC TGTCGCCTGT GGTCTGACTC 60 

CAGGCACTCG CGTCAGGTAC TTCTGCTCCA GGATAGCTTT ATGAATTGCT CTGACAGCAT 120 

CATCAATGGT TCCTTTCGTG CGCTTGGCTT TATTCATGAG GCTCAGGTCG GGGAAAGACT 180 

GATGGTCCAC TGTGACAGCA AGACAGGTAA TGCAAATACG GATTTCATCT GGGTGGGTCC 240 

AGATAACAGA CTGCTAGAGC CGGATAAAGA GATGGAAAAC TTTTACGTGT TTCACAATGG 300 

AAGTCTGGTT ATAGAAAGCC CTCGTTTTGA GGATGCTGGA GTGTATTCTT GTATCGCAAT 360 

GAATAAGCAA CGCCTGTTAA ATGAAACTGT GGACGTCACA ATAAATGTGA GCAATTTCAC 420 

TGTAAGCAGA TCCCATGCTC ATGAGGCATT TAACACAGCT TTTACCACTC TTGCTGCTTG 480 

CGTGGCCAGT ATCGTTTTGG TACTTTTGTA CCTCTATCTG ACTCCATGCC CCTGCAAGTG 540 

TAAAACCAAG AGACAGAAAA ATATGCTACA CCAAAGCAAT GCCCATTCAT CGATTCTCAG 600 

TCCTGGCCCC GCTAGTGATG CCTCCGCTGA TGAACGGAAG GCAGGTGCAG GTAAAAGAGT 660 

GGTGTTTTTG GAACCCCTGA AGGATACTGC AGCAGGGCAG AACGGGAAAG TCAGGCTCTT 720 

TCCCAGCGAG GCAGTGATAG CTGAGGGCAT CCTAAAGTCC ACGAGGGGGA AATCTGACTC 780 
AGATTCAGTC AATTCAGTGT TTTCTGACAC ACCTTTTGTG GCGTCCACTT AA 



Seq ID NO: 11 B dna sequence 

Nucleic Acid Accession ft: AB032989.1 

Coding sequence: 1..1316 



60 
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1 11 21 31 41 51 

CCCATTCCTT GCCCAGTTAC ACAGCACTAC TGGACCTCAG TCACAACAAC CTGAGCCGCC 
TGCGGGCCGA GTGGACCCCC ACGCGCCTGA CCCAACTGCA CTCCCTGCTG CTGAGCCACA 
ACCACCTGAA CTTCATCTCC TCTGAGGCCT TTTCCCCGGT ACCCAACCTG CGCTACCTGG 
ACCTCTCCTC CAACCAGCTG CGTACACTGG ATGAGTTCCT GTTCAGTGAC CTGCAAGTAC 

TGGAGGTGCT GCTGCTCTAC AATAACCACA TCATGGCGGT GGACCGGTGC GCCTTCGATG 300 

ACATGGCCCA GCTGCAGAAA CTCTACTTGA GCCAGAACCA GATCTCTCGC TTCCCTCTGG 3 60 
AACTGGTCAA GGAAGGAGCC AAGCTACCCA AACTAACGCT CCTGGATCTC TCTTCTAACA 
AGCTGAAGAA CTTGCCATTG CCTGACCTGC AGAAGCTGCC GGCCTGGATC AAGAATGGGC 
TGTACCTACA TAACAACCCC CTGAACTGCG ACTGTGAGCT CTACCAGCTG TTTTCACACT 
GGCAGTATCG GCAGCTGAGC TCCGTGATGG ACTTTCAAGA GGATCTGTAC TGCATGAACT 
CCAAGAAGCT GCACAATGTC TTCAACCTGA GTTTCCTCAA CTGTGGCGAG TACAAGGAGC 
GTGCCTGGGA GGCCCACCTG GGTGACACCT TGATCATCAA GTGTGACACC AAGCAGCAAG 
GGATGACCAA GGTGTGGGTG ACACCAAGTA ATGAACGGGT GCTAGATGAG GTGACCAATG 
GCACAGTGAG TGTGTCTAAG GATGGCAGTC TTCTTTTCCA GCAGGTGCAG GTCGAGGACG 

GTGGTGTGTA TACCTGCTAT GCCATGGGAG AGACTTTCAA TGAGACACTG TCTCTGGAAT 900 

TGAAAGTGCA CAATTTCACC TTGCACGGAC ACCATGACAC CCTCAACACA GCCTATACCA 960 

CCCTAGTGGG CTGTATCCTT AGTGTGGTCC TGGTCCTCAT ATACCTATAC CTCACCCCTT 1020 

SSSK GTGCCGGGGT GTAGAGAAGC CTTCCAGCCA TCAAGGAGAC AGCCTCAGCT 1080 

CTTCCATGCT TAGTACCACA CCCAACCATC ATCCTATGGC TGGTGGGGAC AAAGATGATG 1140 

GTTTTGACCG GCGGGTGGCT TTCCTGGAAC CTGCTGGACC TGGGCAGGGT CAAAACGGCA 1200 

AGCTCAAGCC AGGCAACACC CTGCCAGTGC CTGAGGCCAC AGGCAAGGGC CAACGGAGGA 1260 

StCC AGAATCAGTC AGCTCGGTCT TCTCTGATAC GCCCATTGTG ^GAGCAG 1320 

GATGGGTTGG TGGGGAGATT CTGCCCCAGG AGAGGTAATG CACCCCTGAA GOATATGAGG 1380 

GGATGGAAGA GAGGGCTGGC TGCCCAAGGG AATGGATTCC TCCTGACCTC ^GOMTTG 1440 

GTCCCAGGTA CAACAGAGAG CAAGACCCCA AACAGGGCGT GGCTGCCACG ATTTTCAAAT 1500 

GGGGGTATAT TAATCCTCAG GCAAATGCTA CACCCGTACC CAAAGCCCTG CCAATTCTCA 1S60 

GtcSgI GGAAGAGAAG CATGTCTGAG TTGGATGGGA ATGAGGAAGG AGTGAGGGGA 162 

AGAACAGATT CCCTAAACTT TCATGAGGTC ATCCCTAGCT CCTTAAGAGA ^CATTTA 1680 

GAAAAAAAAA TTTTTTCTAT CTCTGCCCAC CCACACCTGT GATGTTGTGT GTGTTGGGGG 1740 

AGOTCCACA GGAGCCACAC TGGGGAAAAG CTGTAGTATC TTCTTTGTTT AGCAAGTCAA 1800 

ACTTCACAGC TGGGGAAGTT GAAAACCTTT GGAAAATGAG TCAGGAAAAG G^GAGACCT 1860 

CAATCGGTAA CAATTCCCCA AAGCTGTTGT AAGACTTTGC ATTAAAGCCT "CTTTTTCC 1920 

TGATGTGCCC TGGGTGGATG GATCTTTGCA GGAGCCTGGC CTGCTGCCTT TTAGTTACAA 1980 

CAGCAATGTG GGATGTGGTA TCTGTGCCTT GTCCTGGGAC CAATGGCACA GAGGGTTATA 2040 

GCATGAACTG AAGGGTCCAT GTCACAGCAT AGGTGCCAGC ACGAAACTTG TAGATGGGGG 2100 
GTGAATTCCA GATAGGCTAA TTCCCTGGTT GGTGAAACCC TTGGTTAGTA TTTGATTCCC 2160 
ScTGTG TGTCAGGCCA GAGTGGGGGC TGCTGAAGGC AGAGCTTTTA TGGATGCCTG 2220 
CCACAGACGT GGTACATGGG CCCACTTCAC GATAGAAGGA TGAAGGTAAA TCAGCCTCTA 2280 
TTATGTAGCA TCCAGCTGGA CCTAGCTGAT TCTCAATGTA TGCCATCTCT TCATCCAGTC 2340 
TCCTTATCAA GCTCAGGTCG GATTCTGAAT TTTCCCTAAT ATGCTGCTCT GTAAAGGGAG 2400 
S^SaAGA AAGCCTGGCC CCAGCTGGCT TAGAGTAGGC TTGGCGATAG TGAAAATCCA 2460 
GTATATATCT CCTCTGTGTG GACAAGGTTG CCTTGCTCCT CTCAGAGATA CCACCTCTGC 2520 
TCCAAAGTCA AAGCATGGGG TTTTGACATC AGAAGGGTTG GAATTGGGTG GTCTCTACCC 2SB0 
Sg^gS ATTTGGAGTG CAGCTTCAAG TTCTGCCCTC TCCACAGATT CCAGGCAATC 2640 
CTGAAGCAGA GATAATCAGT AGTACTAAAC TTTCAGAAGG GGAGTTCTGC CACCCCATTC 2700 

272 
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PCT/US02/21338 



CTCCATCGAO AGGTCTTTGC AQTTGTGQTT GTGAGQCCAG AGGAACTTAA GAAGAATGAG 27S0 

CTCAACATTC CATTCCATCC AAGGTTGCAG AATACTTTGA GTATGGAGAT GACCGATCCC 2820 

TTTAACCTTT CCCTTTGACT ATACTGCATG CCCTGTGAGT GGCTGCTCTG AGCTTTGGCG 2880 

GGGGATGAGC AGGGTGAAGG TGAAAATGTT CTCTGATGTT GGTTTTTCTT AGTGTCTCTT 2940 

5 CTCTCCCACG TTTCTAGTCC ATTGGAAGTT CTCCAGATCC TCTAGATTTC TGACCTCTTT 3000 

TTACATTCTA AGAGGTGTCA TTTACCTGTG GGTGAGGGGC TTGGAGGGCC TGACTATTAA 3060 

GCTTCCTATT ATGATTGGGG AAGGGGACTC AACATCTTCC CCTTGTCTGG TCCTGGGAAC 3120 

TCAAGACTGC TGTCCATCCT TTATTAGTAA CCTAGTTTTG TTGAGAGAGA GGTGCAGCAT 3180 

TTCCCTCAAG TACCTGCATG CTCATAGGTA TGCATGCTTA TAGGTATGCA TGCACACCAG 3240 

10 CACCTTGACT TCCACCAGGA CAGTGAATCT GTAGTCCCCA TTAATGATAC TCTCAGAATG 3300 

GTCTCCACCA ATGTGCACAA CTTCATGCAC ACAATTGTAT TCCTCTTTAG GAGAGACAGT 3360 

GACCATGCCA GGCTCTTCAA ATACACACTC CTGTATTTTC ATTTCTGAAC TCCCTGAAGC 3420 

TTCACCCACA TCTCCATGGC ACTGAATCAC TTTTCTGCCC AGGCTGGAGT TAGAAGAACC 3480 

ATGGGTTGAA GGCCTTGGGA GAAGAGGTGG TAAGGAGCTA CCTGGTTCTT TGGACTCCTC 3540 

15 AGAAACCTCC CGTGAAGACT ATATATATGA GCAATATATG AGCAATTAAG ATAGAACTAG 3600 

GTCATTGAAG GACTCAGAAG GAAACTGGCA AGGGGCTAAG TAGCTGTGAG CTGCTCTTCC 3660 

TGTTAACTGT CCTCAGGAGA GAAAAGTTGT CTTGATTTAT TCCAGAGACT CCTGAGAATT 3720 

TTGTTACTGG AGCTATAGGA ACTAGGACTC CCTCCTCAGA CTCATCAGGA AATCTG GGAA 3780 

AATCTCATCT CCATGCTGCA GTCTCCCCGC TTGGCACCAT CTAGGGCTCT GAAGATCTTT 384 0 

20 AAAGTGAGCG TTAAATGAAG CAGGCCTTCT GGGGCATCGT GATGGAGCTC CTGAGGACTG 3900 

GGAGGGCCGT TTTCTTCATA ATTCCTACCC AGTTTAGGGA ATGAGAAAAG GCCCTGTGGC 3960 

TCTTAAGTCT CCTGACTATA TCCACTAGTT TGGCCTTGAG TTTGAGACAC CTGGTAGATG 4020 

TACCTGGCTG CCCTACCAAG AGATTTGATT TGTTTCTTCC CGTGGTTTCA GGTGATAGAA 4080 

CCCAAAGGTA GCTTTTTAAT TGGAGAATTG AATAATCAGC ATATGTAAGC GCACTAGAAC 4140 

25 CCTGTGTTGA AAACTGCCAG GTGTGGGTAT AAGAAAAGGC CGAGAGATCA CCACCTCCTC 4200 

TTACCCTACC CCACACCCAT AAACCAGACA TGTCTCCCAG GAAGCAGGTG TCCCTGGACA 4260 

CAGAGGATGA CAGGGCTCTA CAATCTGTGT AATTATTATT ATTATTATTA TTTTGTATTT 4320 

ATGGGGCCCA AGAAAGGGGC CAGGAGAGGG TACACCCCAG CTGGGGAGAG CAAAGCAGAT 4380 

GGATCCAGTT TTCTGTGTGT TCTTACCTCT GTACTTCCTC GTAGCTCTGC TGACAAAGCA 4440 

30 AGCAGGCCTC CCGTGTCCAA GACCCCATTC CTCCCACTTG TGTACACCTA GGCTGGCAAA 4500 

TCTTGGAGCC TCTGGGCTCT GAAAACTAGA CAATGATCAT TAAACCTGGC TTGAGTCTCT 4560 
GTTCTGGC 

Seq ID NO: 119 Protein Sequence 
35 Protein Accession #; BAA86477.1 

1 11 21 31 41 51 

I I I I I I 

HSLPSYTALL DLSHNNLSRL RAEWTPTRLT QLHSLLLSHN HLNPISSEAF SPVPNLRYLD 60 

40 LSSNQLRTLD EPLFSDLQVL EVLLLYNNHI MAVDRCAFDD MAQLQKLYLS QNQISRFPLE 120 

LVKEGAKLPK LTLLDLSSNK LKNLPLPDLQ KLPAWIKNGL YLHNNPLNCD CELYQLFSHW 180 

QYRQLSSVMD FQEDLYCMNS KKLHNVFNLS FLNCGEYKER AWEAHLGDTL IIKCDTKQQG 240 

MTKVWVTPSN ERVLDEVTNG TVSVSKDGSL LFQQVQVEDG GVYTCYAMGE TFNETtiSVEL 300 

KVHNFTLHGH HDTLNTAYTT LVGCILSWL VLIYLYLTPC RCWCRGVEKP SSHQGDSLSS 3 60 

45 smlsttpnhd pmaggdkddg fdrrvaflep agpgqgqngk lkpgntlpvp eatgkgqrrm 420 
sdpesvssvf sdtpiw 

Seq ID NO: 120 DNA sequence 
Nucleic Acid Accession fl: NM_00507l 
50 Coding sequence: 9.. 1703 

1 11 21 31 41 51 

GATAGACCAT gagcagccat ggcaacagcc tgttccttcg GGAGAGCGGC CAGCGGCTGG 60 

55 GCCGGGTGGG CTGGCTGCAG CGGCTGCAGG AAAGCCTGCA GCAGAGAGCA CTGCGCACGC 120 

GCCTGCGCCT GCAGACCATG ACCCTCGAGC ACGTGCTGCG CTTCCTGCGC CGAAACGCCT 180 

TCATTCTGCT GACGGTCAGC GCCGTGGTCA TTGGGGTCAG CCTGGCCTTT GCCCTGCGCC 240 

CATATCAGCT CACCTACCGC CAGATCAAGT ACTTCTCTTT TCCTGGAGAG CTTCTGATGA 300 

GGATGCTGCA GATGCTGGTG TTACCTCTCA TTGTCTCCAG CCTGGTCACA GGTATGGCAT 3 60 

60 CCCTGGACAA CAAGGCCACG GGGCGGATGG GGATGCGGGC AGCTGTGTAC TACATGGTGA 420 

CCACCATCAT CGCGGTCTTC ATCGGCATCC TCATGGTCAC CATCATCCAT CCCGGGAAGG 480 

GCTCCAAGGA GGGGCTGCAC CGGGAGGGCC GGATCGAGAC CATCCCCACA GCTGATGCCT 540 

TCATGGACCT GATCAGAAAT ATGTTTCCAC CAAACCTTGT GGAGGCCTGC TTCAAACAGT 600 

TCAAGACGCA GTACAGCACG AGGGTGGTAA CCAGGACCAT GGTGAGGACA GAGAACGGGT 660 

65 CTGAGCCGGG TGCCTCCATG CCTCCTCCAT TCTCAGTGGA GAACGGAACC AGCTTCCTGG 720 

AAAATGTCAC TCGGGCCTTG GGTACCCTGC AGGAGATGCT GAGCTTTGAG GAGACTGTAC 780 

CCGTGCCTGG CTCCGCCAAT GGCATCAACG CCCTGGGCCT CGTGGTCTTC TCTGTGGCCT 840 

TTGGGCTGGT CATTGGTGGC ATGAAACACA AGGGCAGAGT CCTCAGGGAC TTCTTCGACA 900 

GCCTCAATGA GGCTATTATG AGGCTGGTGG GCATCATTAT CTGGTATGCA CCTGTGGGCA 960 

70 TCCTGTTCCT GATTGCTGGG AAGATTCTGG AGATGGAAGA CATGGCCGTC CTGGGGGGTC 1020 

AGCTGGGCAT GTACACCCTG ACCGTCATCG TGGGCCTGTT CCTCCATGCC GGCATTGTCC 1080 

TTCCCCTCAT CTACTTCCTC GTCACTCACC GGAACCCCTT CCCCTTCATT GGGGGCATGC 1140 

TACAAGCCCT CATCACCGCT ATGGGCACGT CTTCCAGCTC GGCAACGCTG CCCATCACCT 1200 

TCCGCTGCCT GGAGGAGGGC CTGGGTGTGG ACCGCCGCAT CACCAGGTTC GTCCTGCCCG 1260 

75 TGGGCGCCAC GGTCAACATG GATGGCACTG CCCTCTACGA GGCCCTGGCT GCCATCTTCA 1320 

TTGCTCAAGT TAACAACTAC GAGCTCAACC TGGGTCAGAT CACAACCATC AGCATCACGG 1380 

CCACAGCAGC CAGTGTTGGG GCTGCTGGCA TCCCCCAGGC GGGTCTGGTC ACCATGGTCA 1440 

TTGTGCTTAC GTCGGTCGGC TTGCCCACGG AAGACATCAC GCTCATCATC GCCGTGGACT 1500 

273 



WO 03/003906 



PCT/US02/21338 
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GGTTCCTTGA CCGGCTTCGC ACAATGACCA AOGTACTGGG GGACTCAATT GGAGCGGCCG 
TCATCGAGCA CTTGTCTCAG CGGGAGCTGG AGCTTCAGGA AGCTGAGCTT ACCCTCCCCA 
GCCTGGGGAA ACCCTACAAG TCCCTCATGG CACAGGAGAA GGGGGCATCC CGGGGACGGG 
GAGGCAAOGA GAGTGCTATG TGAGGGGCCT CCAGCTCTG 

Seq ZD NO« 121 Protein Sequence 
Protein Accession fh NP_005062 



1560 
1620 
1680 



MSSHGNSLFL 
LTVSAWIGV 
NKATGRMGMR 
LIRNMFPPNL 
TRALGTLQEM 
EAIMRLVGII 
IYFLVTHRNP 
TVNMDGTALY 
T SVG LPT ED I 
KPYKSLMAQE 



11 

I 

RESGQRLGRV 
SLAFALRPYQ 
AAVYYMVTTI 
VEACFKQFKT 
LSFEETVPVP 
IWYAPVGILF 
FPFIGGMLQA 
EALAAIFIAQ 
TLIIAVDWFL 
KGASRGRGGN 



21 

! 

GWLQRLQESL 
LTYRQIKYFS 
IAVFIGILMV 
QYSTRWTRT 
GSANGINALG 
LIAGKILEME 
LITAMGTSSS 
VNNYELNLGQ 
DRLRTMTNVL 
ESAM 



31 
I 

QQRALRTRLR 
FPGELLMRML 
TIIHPGKGSK 
MVRTENGSEP 
LWFSVAFGL 
DMAVLGGQLG 
SATLPITFRC 
ITTISITATA 
GDSIGAAVIE 



41 



I 

LQTMTIiEHVL 
QMLVLPLIVS 
EGLHREGRIE 
GASMPPPFSV 
VIGGMKHKGR 
MYTLTVIVGL 
LEEGLGVDRR 
ASVGAAGIPQ 
HLSQRELELQ 



51 
I 

RFLRRNAFIL 
SLVTGMASLD 
TIPTADAFMD 
ENGTSFLENV 
VLRDFFDSLN 
FLHAGIVLPL 
ITRFVLPVGA 
AGLVTMVIVL 
EAELTLPSLG 



Seq ID NO i 122 DNA sequence 

Nucleic Acid Accession Jh NM_03l966.l 

Coding sequence: 178.. 1479 



1 
I 

ACGAACAGGC 
TCTCGGCGTG 
CGGGCCTCCG 
GCGCTCCGAG 
GCAGGCGCAA 
AGAACAGCTC 
ATGAAGAAGG 
CCTCTTGAAA 
GAACCTGAGC 
GTTGATACTG 
CTGTGTCAGG 
GCTGATCCAA 
GAGGAAGAGC 
AGAGCCATCC 
ACCATGTACA 
AAGATGCTGC 
TACCCTCCAG 
ATCAGACAGA 
CCTTTGCACT 
TTGGCCAAAT 
TCTCAAATTG 
ACACCAACTC 
CACCTGGCTA 
AACAAGTATG 
CTAGTTCAAG 
ACTATATTTA 
AAAAAAAAAA 



11 

I 

CAATAAGGAG 
CTGCGGCGGA 
GTGTTCTGCT 
TCACCAGGAA 
AGCGCGTTCC 
TTGGGGACAT 
AAGCAAAACC 
AGGTACCTAT 
CAGAACCTGA 
CCTCTCCAAG 
CTTTCTCTGA 
ACCTTTGTAG 
AAGCAGTCAG 
TAATTGACTG 
TGACTGTCTC 
AGCTGGTTGG 
AAATTGGTGA 
TGGAAATGAA 
TCCTTCGGAG 
ACCTGATGGA 
CAGCAGGAGC 
TACAACATTA 
AGAATGTAGT 
CCACATCGAA 
ATTTAGCCAA 
CAAATAAAAT 
AAAAAAAA 



21 
I 

GGAGCAGTGC 
ACGGCTGTTG 
TCTCCCCGCT 
CTCGAAAATT 
TACGGCCCCT 
TGGTAACAAA 
TTCAGCTACT 
GCTGGTGCCA 
GCCTGTTAAA 
CCCAATGGAA 
TGTAATTCTT 
TGAATATGTG 
ACCAAAATAC 
GCTAGTACAG 
CATTATTGAT 
TGTCACTGCC 
CTTTGCTTTT 
GATTCTAAGA 
AGCATCTAAG 
ACTAACTATG 
TTTTTGCTTA 
CCTGTCATAT 
CATGGTAAAT 
GCATGCTAAG 
GGCTGTGGCA 
TGGCACCATG 



31 
I 

GGGGTTTAAA 
GTTTCTGCTG 
GAGCTGCTGC 
AATGCTGAAA 
QCTGCAACCT 
GTCAGTGAAC 
GGAAAAGTCA 
GTGCCAGTGT 
GAAGAAAAAC 
ACATCTGGAT 
GCAGTAAATG 
AAAGATATTT 
CTACTGGGTC 
GTTCAAATGA 
CGGTTCATGC 
ATGTTTATTG 
GTGACTGACA 
GCTTTAAACT 
ATTGGAGAGG 
TTGGACTATG 
GCACTGAAAA 
ACTGAAGAAT 
CAAGGACTTA 
ATCAGCACTC 
AAGGTGTAAC 
TGCCATCTGT 



41 
I 

TCTGAGGCTA 
GTTGTAGGTC 
CTGGTGAAGA 
ATAAGGCGAA 
CCAAGCCCGG 
AACTGCAGGC 
TTGATAAAAA 
CTGAGCCAGT 
TTTCGCCTGA 
GTGCCCCTGC 
ATGTGGATGC 
ATGCTTATCT 
GGGAAGTCAC 
AATTCAGGTT 
AGAATAATTG 
CAAGCAAATA 
ACACTTATAC 
TTGGTCTGGG 
TTGATGTCGA 
ACATGGTGCA 
TTCTGGATAA 
CTCTTCTTCC 
CAAAGCACAT 
TACCACAGCT 
TTGTAAACTT 
AAAAAAAAAA 



51 
1 

GGCTGGCTCT 
CTTGGCTGGT 
GGAAGCCATG 
GATCAACATG 
ACTGAGGCCA 
CAAAATGCCT 
ACTACCAAAA 
GCCAGAGCCA 
GCCTATTTTG 
AGAAGAAGAC 
AGAAGATGGA 
GAGACAACTT 
TGGAAACATG 
GTTGCAGGAG 
TGTGCCCAAG 
TGAAGAAATG 
TAAGCACCAA 
TCGGCCTCTA 
GCAACATACT 
CTTTCCTCCT 
TGGTGAATGG 
AGTTATGCAG 
GACTGTCAAG 
GAATTCTGCA 
GAGTTGGAGT 
AAAAAAAAAA 



Seq ID NO: 123 Protein Sequence 
Protein Accession 8: NP_114172.1 



MALRVTRNSK 
PMKKEAKPSA 
LVDTASPSPM 
LEEEQAVRPK 
KKMLQLVGVT 
LPLHFLRRAS 
WTPTLQHYLS 
ALVQDLAKAV 



11 
I 

INAENKAKIN 
TGKVIDKKLP 
ETSGCAPAEE 
YLIiGREVTGN 
AMPIASKYEE 
KIGBVDVEQH 
YTEESLLPVM 
AKV 



21 
I 

MAGAKRVPTA 
KPLEKVPMLV 
DLOQAFSDVI 
MRAILIDWLV 
MYPPEIGDFA 
TLAKYLMELT 
QHLAKNWMV 



31 
I 

PAATSKPGIiR 
PVPVSEPVPE 
LAVNDVDAED 
QVQMKFRLLQ 
FVTDNTYTKH 
KLDYDMVHFP 
NQGLTKHMTV 



41 

I 

PRTALGDIGN 
PEPEPEPEPV 
GADPNLCSEY 
ETMYMTVSII 
QIRQMEMKIL 
PSQIAAGAFC 
KNKYATSKHA 



60 

120 

180 

240 

300 

360 

420 

480 

540 



51 

I 

KVSEQLQAKM 
KEEKLSPEPI 
VKDIYAYLRQ 
DRFMQNNCVP 
RALNFGLGRP 
LALKILDNGE 
KISTLPQLNS 



Seq ID NO i 124 DNA sequence 

Nucleic Acid Accession fh Eos sequence 

Coding sequence: 13-1424 



51 



1 11 21 ?* 

[III i 1 

TAGAAGTTTA CAATGAAGTT TCTTCTAATA CTGCTCCTGC AGGCCACTGC TTCTGGAGCT 
CTTCCCCTGA ACAGCTCTAC AAGCCTGGAA AAAAATAATG TGCTATTTGG TGAAAGATAC 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 



60 

120 

180 

240 

300 

3 60 

420 



60 
120 
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TTAGAAAAAT TTTATGGCCT TGAGATAAAC AAACTTCCAG TGACAAAAAT GAAATATAGT 180 

GGAAACTTAA TGAAGGAAAA AATCCAAGAA ATGCAGCACT TCTTGGGTCT GAAAGTGACC 240 

GGGCAACTGG ACACATCTAC CCTGGAGATG ATGCACGCAC CTCGATGTGG AGTCCCCGAT 300 

GTCCATCATT TCAGGGAAAT GCCAGGGGGG CCCGTATGGA GGAAACATTA TATCACCTAC 360 

AGAATCAATA ATTACACACC TGACATGAAC CGTGAGGATG TTGACTACGC AATCCGGAAA 420 

GCTTTCCAAG TATGGAGTAA TGTTACCCCC TTGAAATTCA GCAAGATTAA CACAGGCATG 480 

GCTGACATTT TGGTGGTTTT TGCCCGTGGA GCTCATGGAG ACTTCCATGC TTTTGATGGC S40 

AAAGGTGGAA TCCTAGCCCA TGCTTTTGGA CCTGGATCTG GCATTGGAGG GGATGCACAT 600 

TTCGATGAGG ACGAATTCTG GACTACACAT TCAGGAGGCA CAAACTTGTT CCTCACTGCT 660 

GTTCACGAGA TTGGCCATTC CTTAGGTCTT GGCCATTCTA GTGATCCAAA GGCCGTAATG 720 

TTCCCCACCT ACAAATATGT TGACATCAAC ACATTTCGCC TCTCTGCTGA TGACATACGT 780 

GGCATTCAGT CCCTGTATGG AGACCCAAAA GAGAACCAAC GCTTGCCAAA TCCTGACAAT 840 

TCAGAACCAG CTCTCTGTGA CCCCAATTTG AGTTTTGATG CTGTCACTAC CGTGGGAAAT 900 

AAGAT CTTTT TCTTCAAAGA CAGGTTCTTC TGGCTGAAGG TTTCTGAGAG ACCAAAGACC 960 

AGTGTTAATT TAATTTCTTC CTTATGGCCA ACCTTGCCAT CTGGCATTGA AGCTGCTTAT 1020 

GAAATTGAAG CCAGAAATCA AGTTTTTCTT TTTAAAGATG ACAAATACTG GTTAATTAGC 1080 

AATTTAAGAC CAGAGCCAAA TTATCCCAAG AGCATACATT CTTTTGGTTT TCCTAACTTT 1140 

GTGAAAAAAA TTGATGCAGC TGTTTTTAAC CCACGTTTTT ATAGGACCTA CTTCTTTGTA 1200 

GATAACCAGT ATTGGAGGTA TGATGAAAGG AGACAGATGA TGGACCCTGG TTATCCCAAA 1260 

CTGATTACCA AGAACTTCCA AGGAATCGGG CCTAAAATTG ATGCAGTCTT CTACTCTAAA 1320 

AACAAATACT ACTATTTCTT CCAAGGATCT AACCAATTTG AATATGACTT CCTACTCCAA 1380 

CGTATCACCA AAACACTGAA AAGCAATAGC TGGTTTGGTT GTTGAAAATG GTGTAATTAA 1440 

TGGTTTTTGT TAGTTCACTT CAGCTTAATA AG TAT TT ATT GCATATTTGC TATGTCCTCA 1500 

GTGTACCACT ACTTAGAGAT ATGTATCATA AAAATAAAAT CTGTAAACCA TAGGTAATGA 1560 

TTATATAAAA TACATAATAT TTTTCAATTT TGAAAACTCT AATTGTCCAT TCTTGCTTGA 1620 

CTCTACTATT AAGTTTGAAA ATAGTTACCT TCAAAGCAAG ATAATTCTAT TTGAAGCATG 1680 

CTCTGTAAGT TGCTTCCTAA CATCCTTGGA CTGAGAAATT ATACTTACTT CTGGCATAAC 1740 
TAAAATTAAG TATATATATT TTGGCTCAAA TAAAATTG 



Seq ID MO: 125 Protein sequence: 
Protein Accession ft» Eos sequence 



1 11 21 31 41 51 

I I I I I I 

MKFLLILLLQ ATASGALPLN SSTSIiEKNNV LPGERYLEKF YGLEINKLPV TKMKYSGNLM 60 

KEKIQEMQHF LGLKVTGQLD TSTLEMMHAP RCGVPDVHHF REMPGGPVWR KHYITYRINN 120 

YTPDMNREDV DYAIRKAFQV WSNVTPLKFS KINTGMADIL WFARGAHGD FHAFDGKGGI 180 

LAHAFGPGSG IGGDAHFDED EFWTTHSGGT NLFLTAVHEI GHSLGLGHSS DPKAVMFPTY 240 

KYVDINTFRL SADDIRGIQS LYGDPKENQR LPNPDNSEPA LCDPNLSFDA VTTVGNKIFF 300 

FKDRFFWLKV SERPKTSVNL ISSLWPTLPS GIEAAYEIEA RNQVFLFKDD KYWLISNLRP 360 

EPNYPKSIHS FGFPNFVKKI DAAVFNPRFY RTYFFVDNQY WRYDERRQMM DPGYPKLITK 420 
NFQGIGPKID AVFYSKNKYY YFFQGSNQFE YDFLLQRITK TLKSNSWFGC 

Seq ID NO: 126 DNA sequence 

Nucleic Acid Accession ft: NMJJ00077.2 

Coding sequence t 2 72.. 742 

1 11 21 31 41 51 

) | | I I I 

CCCAACCTGG GGCGACTTCA GGTGTGCCAC ATTCGCTAAG TGCTCGGAGT TAATAGCACC 60 

TCCTCCGAGC ACTCGCTCAC GGCGTCCCCT TGCCTGGAAA GATACCGCGG TCCCTCCAGA 120 

GGATTTGAGG GACAGGGTCG GAGGGGGCTC TTCCGCCAGC ACCGGAGGAA GAAAGAGGAG 180 

GGGCTGGCTG GTCACCAGAG GGTGGGGCGG ACCGCGTGCG CTCGGCGGCT GCGGAGAGGG 240 

GGAGAGCAGG CAGCGGGCGG CGGGGAGCAG CATGGAGCCG GCGGCGGGGA GCAGCATGGA 300 

GCCTTCGGCT GACTGGCTGG CCACGGCCGC GGCCCGGGGT CGGGTAGAGG AGGTGCGGGC 360 

GCTGCTGGAG GCGGGGGCGC TGCCCAACGC ACCGAATAGT TACGGTCGGA GGCCGATCCA 420 

GGTCATGATG ATGGGCAGCG CCCGAGTGGC GGAGCTGCTG CTGCTCCACG GCGCGGAGCC 480 

CAACTGCGCC GACCCCGCCA CTCTCACCCG ACCCGTGCAC GACGCTGCCC GGGAGGGCTT 540 

CCTGGACACG CTGGTGGTGC TGCACCGGGC CGGGGCGCGG CTGGACGTGC GCGATGCCTG 600 

GGGCCGTCTG CCCGTGGACC TGGCTGAGGA GCTGGGCCAT CGCGATGTCG CACGGTACCT 660 

GCGCGCGGCT GCGGGGGGCA CCAGAGGCAG TAACCATGCC CGCATAGATG CCGCGGAAGG 720 

TCCCTCAGAC ATCCCCGATT GAAAGAACCA GAGAGGCTCT GAGAAACCTC GGGAAACTTA 780 

GATCATCAGT CACCGAAGGT CCTACAGGGC CACAACTGCC CCCGCCACAA CCCACCCCGC 840 

TTTCGTAGTT TTCATTTAGA AAATAGAGCT TTTAAAAATG T CCTGC CTTT TAACGTAGAT 900 

ATATGCCTTC CCCCACTACC GTAAATGTCC ATTTATATCA TTTTTTATAT ATTCTTATAA 960 

AAATGTAAAA AAGAAAAACA CCGCTTCTGC CTTTTCACTG TGTTGGAGTT TTCTGGAGTG 1020 

AGCACTCACG CCCTAAGCGC ACATTCATGT GGGCATTTCT TGCGAGCCTC GCAGCCTCCG 1080 

GAAGCTGTCG ACTTCATGAC AAGCATTTTG TGAACTAGGG AAGCTCAGGG GGGTTACTGG 1140 

CTTCTCTTGA GTCACACTGC TAGCAAATGG CAGAACCAAA GCTCAAATAA AAATAAAATA 1200 
ATTTTCATTC ATTCACTC 



Seq ID NO: 127 Protein Sequence 
Protein Accession ft: NP_000068.1 

1 11 21 31 fl 51 

| | | 1 I I 

MEPAAGSSME PSADWLATAA ARGRVEEVRA LLEAGALPNA PNSYGRRPIQ VMMMGSARVA 60 

ELLLLBGAEP NCADPATLTR PVHDAAREGF IxDTLWLHRA GARLDVRDAW GRLPVDLAEE 120 



275 



WO 03/003906 



PCT/US02/21338 



LGHRDVARYL RAAAGGTRGS NHARIDAAEG PSDIPD 

Seq ID NO: 128 DNA sequence 
Nucleic Acid Accession ft: NM_058196.1 
5 Coding sequence: 104.. 421 

1 11 21 31 41 51 

i I I I I I 

TGTGTGGGGG TCTGCTTGGC GGTGAGGGGG CTCTACACAA GCTTCCTTTC CGTCATGCCG 60 

10 GCCCCCACCC TGGCTCTGAC CATTCTGTTC TCTCTGGCAG GTCATGATGA TGGGCAGCGC 120 

CCGAGTGGCG GAGCTGCTGC TGCTCCACGG CGCGGAGCCC AACTGCGCCG ACCCCGCCAC 180 

TCTCACCCGA CCCGTGCACG ACGCTGCCCG GGAGGGCTTC CTGGACACGC TGGTGGTGCT 240 

GCACCGGGCC GGGGCGCGGC TGGACGTGCG CGATGCCTGG GGCCGTCTGC CCGTGGACCT 300 

GGCTGAGGAG CTGGGCCATC GCGATGTCGC ACGGTACCTG CGCGCGGCTG CGGGGGGCAC 360 

15 CAGAGGCAGT AACCATGCCC GCATAGATGC CGCGGAAGGT CCCTCAGACA TCCCCGATTG 420 

AAAGAACCAG AGAGGCTCTG AGAAACCTCG GGAAACTTAG ATCATCAGTC ACCGAAGGTC 4 80 

CTACAGGGCC ACAACTGCCC CCGCCACAAC CCACCCCGCT TTCGTAGTTT TCATTTAGAA 540 

AATAGAGCTT TTAAAAATGT CCTGCCTTTT AACGTAGATA TAAGCCTTCC CCCACTACCG 600 

TAAATGTCCA TTTATATCAT TTTTTATATA TTCTTATAAA AATGTAAAAA AGAAAAACAC 660 

20 CGCTTCTGCC TTTTCACTGT GTTGGAGTTT TCTGGAGTGA GCACTCACGC CCTAAGCGCA 720 

CATTCATGTG GGCATTTCTT GCGAGCCTCG CAGCCTCCGG AAGCTGTCGA CTTCATGACA 780 

AGCATTTTGT GAACTAGGGA AGCTCAGGGG GGTTACTGGC TTCTCTTGAG TCACACTGCT 840 
AGCAAATGGC AGAACCAAAG CTCAAATAAA AATAAAATAA TTTTCATTCA TTCACTC 

25 Seq ID NO; 129 Protein Sequence 
Protein Accession #: NP_478103.1 

1 11 21 31 41 51 

30 MMMGSARVAE LLLLHGAEPN CADPATLTRP VHDAAREGFL DTLWLHRAG ARLDVRDAWG 60 
RLPVDLAEEL GHRDVARYLR AAAGGTRGSN HARIDAAEGP SDIPD 

Seq ID NO: 130 DNA sequence 
Nucleic Acid Accession tt: NW_058197.1 
35 Coding sequence: 272.. 646 

1 11 21 31 41 51 

I I I I I I 

CCCAACCTGG GGCGACTTCA GGTGTGCCAC ATTCGCTAAG TGCTCGGAGT TAATAGCACC 60 

40 TCCTCCGAGC ACTCGCTCAC GGCGTCCCCT TGCCTGGAAA GATACCGCGG TCCCTCCAGA 120 

GGATTTGAGG GACAGGGTCG GAGGGGGCTC TTCCGCCAGC ACCGGAGGAA GAAAGAGGAG 180 

GGGCTGGCTG GTCACCAGAG GGTGGGGCGG ACCGCGTGCG CTCGGCGGCT GCGGAGAGGG 240 

GGAGAGCAGG CAGCGGGCGG CGGGGAGCAG CATGGAGCCG GCGGOGGGGA GCAGCATGGA 300 

GCCGGCGGCG GGGAGCAGCA TGGAGCCTTC GGCTGACTGG CTGGCCACGG CCGCGGCCCG 360 

45 GGGTCGGGTA GAGGAGGTGC GGGCGCTGCT GGAGGCGGGG GCGCTGCCCA ACGCACCGAA 420 

TAGTTACGGT CGGAGGCCGA TCCAGGTGGG TAGAAGGTCT GCAGCGGGAG CAGGGGATGG 480 

CGGGCGACTC TGGAGGACGA AGTTTGCAGG GGAATTGGAA TCAGGTAGCG CTTCGATTCT 540 

CCGGAAAAAG GGGAGGCTTC CTGGGGAGTT TTCAGAAGGG GTTTGTAATC ACAGACCTCC 600 

TCCTGGCGAC GCCCTGGGGG CTTGGGAAAC CAAGGAAGAG GAATGAGGAG CCACGCGCGT 660 

50 ACAGATCTCT CGAATGCTGA GAAGATCTGA AGGGGGGAAC ATATTTGTAT TAGATGGAAG 720 

TCATGATGAT GGGCAGCGCC CGAGTGGCGG AGCTGCTGCT GCTCCACGGC GCGGAGCCCA 780 

ACTGCGCCGA CCCCGCCACT CTCACCCGAC CCGTGCACGA CGCTGCCCGG GAGGGCTTCC 840 

TGGACACGCT GGTGGTGCTG CACCGGGCCG GGGCGCGGCT GGACGTGCGC GATGCCTGGG 900 

GCCGTCTGCC CGTGGACCTG GCTGAGGAGC TGGGCCATCG CGATGTCGCA CGGTACCTGC 960 

55 GCGCGGCTGC GGGGGGCACC AGAGGCAGTA ACCATGCCCG CATAGATGCC GCGGAAGGTC 1020 

CCTCAGACAT CCCCGATTGA AAGAACCAGA GAGGCTCTGA GAAACCTCGG GAACTTAGAT 1080 

CATCAGTCAC CGAAGGTCCT ACAGGGCCAC AACTGCCCCC GCCACAACCC ACCCCGCTTT 1140 

CGTAGTTTTC ATTTAGAAAA TAGAGCTTTT AAAAATGTCC TGCCTTTTAA CGTAGATATA 1200 

TGCCTTCCCC CACTACCGTA AATGTCCATT TATATCATTT TTTATATATT CTTATAAAAA 1260 

60 TGTAAAAAAG AAAAACACCG CTTCTGCCTT TTCACTGTGT TGGAGTTTTC TGGAGTGAGC 1320 

ACTCACGCCC TAAGCGCACA TTCATGTGGG CATTTCTTGC GAGCCTCGCA GCCTCCGGAA 1380 

GCTGTCGACT TCATGACAAG CATTTTGTGA ACTAGGGAAG CTCAGGGGGG TTACTGGCTT 1440 

CTCTTGAGTC ACACTGCTAG CAAATGGCAG AACCAAAGCT CAAATAAAAA TAAAATAATT 1S00 
TTCATTCATT CACTC 



65 



Seq ID NO-. 131 Protein Sequence 
Protein Accession 8: NP_478104.1 



1 11 21 31 41 51 

70 | | | ill 

MEPAAGSSME PAAGSSMBPS ADWLATAAAR GRVEEVRALL EAGALPNAPN SYGRRPIQVG 60 
RRSAAGAGDG GRLWRTKFAG ELESGSASIL RKKGRLPGEF SEGVCNHRPP PGD ALGA WET 120 
KEEE 

75 Seq ID NO: 132 DNA sequence 

Nucleic Acid Accession ft: NM_058195.1 
Coding sequence: 163.. 6B4 



276 



WO 03/003906 



PCT/US02/21338 



1 11 21 31 41 51 

I I I I I I 

CCTCCCTACG GGCGCCTCCG GCAGCCCTTC CCGCGTGCGC AGGGCTCAGA GCCGTTCCGA 60 

GATCTTGGAG GTCCGGGTGG GAGTGGGGGT GGGGTGGGGG TGGGGGTGAA GGTGGGGGGC 120 

5 GGGOGCGCTC AGGGAAGGCG GGTGCGCGCC TGCGGGGCGG AGATGGGCAG GGGGCGGTGC 180 

GTGGGTCCCA GTCTGCAGTT AAGGGGGCAG GAGTGGCGCT GCTCACCTCT GGTGCCAAAG 240 

GGCGGCGCAG CGGCTGCCGA GCTCGGCCCT GGAGGCGGCG AGAACATGGT GCGCAGGTTC 300 

TTGGTGACCC TCCGGATTCG GCGCGCGTGC GGCCCGCCGC GAGTGAGGGT TTTCGTGGTT 360 

CACATCCCGC GGCTCACGGG GGAGTGGGCA GCGCCAGGGG CGCCCGCCGC TGTGGCCCTC 420 

10 GTGCTGATGC TACTGAGGAG CCAGCGTCTA GGGCAGCAGC CGCTTCCTAG AAGACCAGGT 480 

CATGATGATG GGCAGCGCCC GAGTGGCGGA GCTGCTGCTG CTCCACGGCG CGGAGCCCAA 540 

CTGCGCCGAC CCCGCCACTC TCACCCGACC CGTGCACGAC GCTGCCCGGG AGGGCTTCCT 600 

GGACACGCTG GTGGTGCTGC ACCGGGCCGG GGCGCGGCTG GACGTGCGCG ATGCCTGGGO 660 

CCGTCTGCCC GTGGACCTGG CTGAGGAGCT GGGCCATCGC GATGTCGCAC GGTACCTGCG 720 

15 CGCGGCTGCG GGGGGCACCA GAGGCAGTAA CCATGCCCGC ATAGATGCCG CGGAAGGTCC 780 

CTCAGACATC CCCGATTGAA AGAACCAGAG AGGCTCTGAG AAACCTCGGG AAACTTAGAT 840 

CATCAGTCAC CGAAGGTCCT ACAGGGCCAC AACTGCCCCC GCCACAACCC ACCCCGCTTT 900 

CGTAGTTTTC ATTTAGAAAA TAGAGCTTTT AAAAATGTCC TGCCTTTTAA CGTAGATATA 960 

TGCCTTCCCC CACTACCGTA AATGTCCATT TATATCATTT TTTATATATT CTTATAAAAA 1020 

20 TGTAAAAAAG AAAAACACCG CTTCTGCCTT TTCACTGTGT TGGAGTTTTC TGGAGTGAGC 1080 

ACTCACGCCC TAAGCGCACA TTCATGTGGG CATTTCTTGC GAGCCTCGCA GCCTCCGGAA 1140 

GCTGTCGACT TCATGACAAG CATTTTGTGA ACTAGGGAAG CTCAGGGGGG TTACTGGCTT 1200 

CTCTTGAGTC ACACTGCTAG CAAATGGCAG AACCAAAGCT CAAATAAAAA TAAAATAATT 1260 
TTCATTCATT CACTC 



25 



70 



Seq ID NO: 133 Protein Sequence 
Protein Accession 8: KP 478102.1 



1 11 21 31 41 51 

30 | | | 1 | I 

MGRGRCVGPS LQLRGQEWRC SPLVPKGGAA AAELGPGGGE NMVRRFLVTL RIRRACGPPR 60 
VRVFWHIPR LTGBWAAPGA PAAVALVLML LRSQRLGQQP LPRRPGHDDG QRPSGGAAAA 120 
PRRGAQLRRP RHSHPTRARR CPGGLPGHAG GAAPGRGAAG RARCLGPSAR GPG 

35 Seq ID NO: 134 DNA sequence 

Nucleic Acid Accession 8: NM_023915 
Coding sequence: 1..1077 

1 11 21 31 41 51 

40 | || | || 

ATGGGGTTCA ACTTGACGCT TGCAAAATTA CCAAATAACG AGCTGCACGG CCAAGAGAGT 60 

CACAATTCAG GCAACAGGAG CGACGGGCCA GGAAAGAACA CCACCCTTCA CAATGAATTT 120 

GACACAATTG TCTTGCCGGT GCTTTATCTC ATTATATTTG TGGCAAGCAT CTTGCTGAAT 180 

GGTTTAGCAG TGTGGATCTT CTTCCACATT AGGAATAAAA CCAGCTTCAT ATTCTATCTC 240 

45 AAAAACATAG TGGTTGCAGA CCTCATAATG ACGCTGACAT TTCCATTTCG AATAG TCCA T 300 

GATGCAGGAT TTGGACCTTG GTACTTCAAG TTTATTCTCT GCAGATACAC TTCAGTTTTG 360 

TTTTATGCAA ACATGTATAC TTCCATCGTG TTCCTTGGGC TGATAAGCAT TGATCGCTAT 420 

CTGAAGGTGG TCAAGCCATT TGGGGACTCT CGGATGTACA GCATAACCTT CACGAAGGTT 480 

TTATCTGTTT GTGTTTGGGT GATCATGGCT GTTTTGTCTT TGCCAAACAT CATCCTAACA 540 

50 AATGGTCAGC CAACAGAGGA CAATATCCAT GACTGCTCAA AACTTAAAAG TCCTTTGGGG 600 

GTCAAATGGC ATACGGCAGT CACCTATGTG AACAGCTGCT TGTTTGTGGC CGTGCTGGTG 660 

ATTCTGATCG GATGTTACAT AGCCATATCC AGGTACATCC ACAAATCCAG CAGGCAATTC 720 

ATAAGTCAGT CAAGCCGAAA GCGAAAACAT AACCAGAGCA TCAGGGTTGT TGTGGCTGTG 780 

TTTTTTACCT GCTTTCTACC ATATCACTTG TGCAGAATTC CTTTTACTTT TAGTCACTTA 840 

55 GACAGGCTTT TAGATGAATC TGCACAAAAA ATCCTATATT ACTGCAAAGA AATTACACTT 900 

TTCTTGTCTG CGTGTAATGT TTGCCTGGAT CCAATAATTT ACTTTTTCAT GTGTAGGTCA 960 

TTTTCAAGAA GGCTGTTCAA AAAATCAAAT ATCAGAACCA GGAGTGAAAG CATCAGATCA 1020 
CTGCAAAGTG TGAGAAGATC GGAAGTTCGC ATATATTATG ATTACACTGA TGTGTAA 

60 Seq ID NO: 135 Protein Sequence 
Protein Accession 8: MP_076404 

1 11 21 31 41 51 

~ I 1 I I I I 

OJ MGPNLTLAKL PNNELHGQES HKSGNRSDGP GKNTTLHNEP DTIVLPVLYL IIFVASILLM 60 

GLAVWIFFHI RNKTSFIFYL KNIWADLIM TLTFPPRIVH DAGFGPWYFK FILCRYTSVL 120 

FYANMYTSIV FLGLISIDRY LKWKPFGDS RMYSITFTKV LSVCVWVIMA VLSLPNIILT 180 

NGQPTEDNIH DCSKLKSPLG VKWHTAVTYV NSCLFVAVLV IIiIGCYIAIS RYIHKSSRQF 240 

ISQSSRKRKH NQSIRWVAV FFTCFLPYHL CRIPFTFSHL DRLLDESAQK ILYYCKEITL 300 
FLSACNVCLD PIIYFFMCRS PSRRLFKKSN IRTRSESIRS LQSVRRSEVR IYYDYTDV 



Seq ID NO: 136 DNA sequence 
Nucleic Acid Accession ftt NM_005329.1 
75 Coding sequence: 1..1662 

1 11 21 31 41 51 

I I I I I I 

277 



WO 03/003906 



PCT/US02/21338 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 



ATGCCGGTGC 
GTGCTGGGTG 
CACTACCTGT 
CTTTTTGCCT 
TCCCCGCGGC 
TTGCGCAAGT 
GTGGTGGATG 
GGCGGCACCG 
GGTGAGACGG 
AGCACCTTCT 
TTCAAGGCCC 
GATCCAOCCT 
GTCGGGGGAG 
GTGCGGTACT 
CAGTGTATTA 
GACTGGTACC 
ACCAACCGAG 
ACAGAGACCC 
TACTTCCGGG 
TACGAGTCAG 
TTCTACCGGG 
ATTATCAAGG 
CTCTACTCCC 
ATCAACAAAT 
CTCATTCCTG 
TGCCAGGACC 
GGCTGCTACT 
AAGAAGCCGG 



AGCTGACGAC 
GCATCCTGGC 
CCTTCGGCCT 
TCCTGGAGCA 
GGGGCTCGGT 
GCCTGCGCTC 
GCAACCGCCA 
AGCAGGCCGG 
AGGCCAGCCT 
CGTGCATCAT 
TCGGCGATTC 
GCACCATCGA 
ATGTCCAGAT 
GGATGGCCTT 
GTGGGCCCTT 
ATCAGAAGTT 
TCCTGAGCCT 
CCACTAAGTA 
AGTGGCTCTA 
TGGTCACGGG 
GCCGCATCTG 
CCACCTACGC 
TCCTCTATAT 
CTGGCTGGGG 
TGTCCATCTG 
TGTTCAGTGA 
GGGTGGCCCT 
AGCAGTACAG 



AGCCCTGCGT 
AGCCTATGTG 
GTACGGCGCC 
CCGGCGCATG 
GGCACTGTGC 
GGCCCAGCGC 
GGAGGACGCC 
CTTCTTTGTG 
GCAGGAGGGC 
GCAGAAGTGG 
GGTGGACTAC 
GATGCTTCGA 
CCTCAACAAG 
CAACGTGGAG 
GGGCATGTAC 
CCTAGGCAGC 
TGGCTACCGA 
CCTCCGGTGG 
CAACTCTCTG 
TTTCTTCCCC 
GAACATTCTC 
CTGCTTCCTT 
GTCCAGCCTT 
CACCTCTGGC 
GGTGGCAGTT 
GACAGAGCTA 
CCTCATGCTA 
CTTGGCTTTT 



GTGGTGGGCA 
ACGGGCTACC 
ATCCTGGGCC 
CGACGTGCCG 
ATTGCCGCGT 
ATCTCCTTCC 
TACATGCTGG 
TGGCpCAGCA 
ATGGACCGTG 
GGAGGCAAGC 
ATCCAGGTGT 
GTCCTGGAGG 
TACGACTCAT 
CGGGCCTGCC 
CGCAACAGCC 
AAGTGCAGCT 
ACTAAGTATA 
CTCAACCAGC 
TGGTTCCATA 
TTCTTCCTCA 

CGGGGCAATG 
CTGCCGGCCA 
CGAAAAACCA 
CTCCTGGGAG 
GCCTTCCTTG 
TATCTGGCCA 
GCTGAGGTGT 



Seq ID NOt 137 Protein Sequence 
Protein Accession #: NP 005320.1 



MPVQLTTALR 
LFAFLEHRRM 
WDGNRQEDA 
STFSCIMQKW 
VGGDVQILNK 
DWYHQKFLGS 
YFREWLYNSL 
IIKATYACFL 
LIPVSIWVAV 
KKPEQYSLAF 



11 
I 

WGTSLPALA 
RRAGQALKLP 
YMLDIFHEVL 
GGKREVMYTA 
YDSWISPLSS 
KCSPGDDRHL 
WFHKHHLWMT 
RGKAEMIFMS 
LLGGLAYTAY 
AEV 



21 
I 

VLGGILAAYV 
SPRRGSVALC 
GGTEQAGFFV 
FKALGDSVDY 
VRYWMAFNVE 
TNRVLSLGYR 
YESWTGFFP 
LYSLLYMSSL 
CQDLFSETEL 



31 
I 

TGYQFIHTEK 
IAAYQEDPDY 
WRSNFHEAGE 
IQVCDSDTVL 
RACQSYFGCV 
TKYTARSKCL 
FFLIATVIQXj 
LPAKIFAIAT 
AFLVSGAILY 



CCAGCCTGTT 
AGTTCATCCA 
TGCACCTGCT 
GCCAGGCCCT 
ACCAGGAGGA 
CTGACCTCAA 
ACATCTTCCA 
ACTTCCATGA 
TGCGGGATGT 
GCGAGGTCAT 
GCGACTCTGA 
AGGATCCCCA 
GGATTTCCTT 
AGTCCTACTT 
TCCTCCAGCA 
TCGGGGATGA 
CCGCGCGCTC 
AAACCCGCTG 
AGCACCACCT 
TTGCCACGGT 
TGACGGTGCA 
CAGAGATGAT 
AGATCTTTGC 
TTGTGGTGAA 
GGCTGGCCTA 
TCTCTGGGGC 
TCATCGCCCG 
GA 



41 
I 

HYLSFGLYGA 
LRKCLRSAQR 
GETEASLQEG 
DPACTIEMLR 
QCISGPLGMY 
TETPTKYLRW 
FYRGRIWNIL 
INKSGWGTSG 
GCYWVALLML 



TGCCCTGGCA 
CACGGAAAAG 
CATTCAGAGC 
GAAGCTGCCC 
CCCTGACTAC 
GGTGGTCATG 
CGAGGTGCTG 
GGCAGGCGAG 
GGTGCGGGCC 
GTACACGGCC 
CACTGTGCTG 
AGTAGGGGGA 
CCTGAGCAGC 
TGGCTGTGTG 
GTTCCTGGAG 
CCGGCACCTC 
CAAGTGCCTC 
GAGCAAGTCT 
CTGGATGACC 
TATACAGCTT 
GCTGGTGGGC 
CTTCATGTCC 
CATTGCTACC 

CTTCATTGGC 
CACAGCTTAT 
TATACTGTAT 
GCGATGTGGG 



51 
I 

ILGLHLLIQS 
ISFPDLKWM 
MDRVRDWRA 
VLEEDPQVGG 
RNSLLQQFIiE 
LNQQTRWSKS 
LFLLTVQLVG 
RKTIWNFIG 
YLAIIARRCG 



Seq ID NO > 138 DNA Sequence 
Nucleic Acid Accession #: NM_001327. 
Coding sequencer 89.. 631 



1 
I 

AGCAGGGGGC 
CTGAGAGCCG 
GACGGGCGAT 
TGGCGGCCCA 
AAGGGCCTCG 
GCTGAATGGA 
CCTCGCCATG 
GGATGCCCCA 
CATACTGACT 
CTGTCTCCAG 
GGCTCAGCCT 
GCCTCCTCCC 
GTTTGTCGCT 



11 
I 

GCTGTGTGTA 
GGCAGAGGCT 
GCTGATGGCC 
GGAGAGGCGG 
GGGCCGGGAG 
TGCTGCAGAT 
CCTTTCGCGA 
CCGCTTCCCG 
ATCCGACTGA 
CAGCTTTCCC 
CCCTCAGGGC 
CTAGGGAATG 
GGAGGAGGAC 



21 
I 

CCGAGAATAC 
CCGGAGCCAT 
CAGGAGGCCC 
GTGCCACGGG 
OAGGCGCCCC 
GCGGGGCCAG 
CACCCATGGA 
TGCCAGGGGT 
CTGCTGCAGA 
TGTTGATGTG 
AGAGGCGCTA 
GTCCCAGCAC 
GGCTTACATG 



31 
I 

GAGAATACCT 
GCAGGCCGAA 
TGGCATTCCT 
CGGCAGAGGT 
GCGGGGTCCG 
GGGGCGGGAG 
AGCAGAGCTG 
GCTTCTGAAG 
CCACCGCCAA 
GATCACGCAG 
AGCCCAGCCT 
GAGTGGCCAG 
TTTGTTTCTG 



41 
I 

CGTGGGCCCT 
GGCCGGGGCA 
GATGGCCCAG 
CCCCGGGGCG 
CATGGCGGCG 
AGCCGCCTGC 
GCCCGCAGGA 
GAGTTCACTG 
CTGCAGCTCT 
TGCTTTCTGC 
GGCGCCCCTT 
TTCATTGTGG 
TAGAAAATAA 



51 
I 

GACCTTCTCT 
CAGGGGGTTC 
GGGGCAATGC 
CAGGGGCAGC 
CGGCTTCAGG 
TTGAGTTCTA 
GCCTGGCCCA 
TGTCCGGCAA 
CCATCAGCTC 
CCGTGTTTTT 
CCTAGGTCAT 
GGGCCTGATT 
AACTGAGCTA 



Seq ID NO: 13 9 Protein Sequence 
Protein Accession 8: NP 001318.1 



1 11 21 31 41 51 

I I I I I I 

MQAEGRGTGG STGDADGPGG PGIPDGPGGN aggpgeagat ggrgprgaga ARASGPGGGA 
PRGPHGGAAS GLNGCCRCGA RGPESRLLEF YLAMPFATPM EAELARRSLA QDAPPLPVPG 
VLL.KEFTVSG NILTIRLTAA DHRQLQLSIS SCLQQLSLLM WITQCPLPVF IAQPPSGQRR 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 
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1260 

1320 

13B0 
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60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 



60 

120 

180 



Seq ID NO: 140 DNA Sequence 

Nucleic Acid Accession 8i Eos sequence 



278 



WO 03/003906 



PCT/US02/21338 



Coding sequence? 53.. 459 

1 11 21 31 41 51 

-II I III 

J CCTCGTGGGC CCTGACCTTC TCTCTGAGAG CCGGGCAGAG GCTCCGGAGC CATGCAGGCC 60 

GAAGGCCAGG GCACAGGGGG TTCGACGGGC GATGCTGATG GCCCAGGAGG CCCTGGCATT 120 

CCTGATGGCC CAGGGGGCAA TGCTGGCGGC CCAGGAGAGG CGGGTGCCAC GGGCGGCAGA 180 

GGTCCCCGGG GCGCAGGGGC AGCAAGGGCC TCGGGGCCGA GAGGAGGCGC CCCGCGGGGT 240 

CCGCATGGCO GTGCCGCTTC TGCGCAGGAT GGAAGGTGCC CCTGCGGGGC CAGGAGGCCG 300 

10 GACAGCCGCC TGCTTCAGTT CCGACTGACT GCTGCAGACC ACCGCCAACT GCAGCTCTCC 3 60 

ATCAGCTCCT GTCTCCAGCA GCTTTCCCTG TTGATGTGGA TCACGCAGTG CTTTCTGCCC 420 

GTGTTTTTGG CTCAGGCTCC CTCAGGGCAG AGGCGCTAAG CCCAGCCTGG CGCCCCTTCC 480 

TAGGTCATGC CTCCTCCCCT AGGGAATGGT CCCAGCACGA GTGGCCAGTT CATTGTGGGG 540 

GCCTGATTGT TTGTCGCTGG AGGAGGACGG CTTACATGTT TGTTTCTGTA GAAAATAAAG 600 
CTGAGCTA 

Seq ID NO: 141 Protein Sequence 
Protein Accession ftt Eos sequence 

20 1 11 21 31 41 51 

I 1 1 1 I 1 

MQAEGQQTGG STGDADGPGG PGIPDGPGGN AGGPGEAGAT GGRGPRGAGA ARASGPRGGA 60 
PRGPHGGAAS AQDGRCPCGA RRPDSRLLQF RLTAADHRQL QLSISSCLQQ LSLLMWITQC 120 
FLPVFLAQAP SGQRR 



15 



25 
30 



70 



Seq ID MO i 142 DNA Sequence 

Nucleic Acid Accession #: NM_130467.2 

Coding sequence; 246.. 638 



1 11 21 31 41 51 

I I I I I I 

AGGCAGAGCT CTGCAAGGAG AGGTTGTGTC TTCGTTCTTT CCGCCATCTT CGTTCTTTCC 60 

AACATCTTCG TTCTTTCTCA CTGACCGAGA CTCAGCCGGT AGGTCTGCAG AGTGGTCTTC 120 

35 CTGGTAATTT AGTTGTGAGT GAATGTGTGG AGGAGCCAGC GGGCTTAGGA CAGGTCCTGT 180 

GGCACAGTCC GTGGCTTTGA GGGAAAAGGG CCTCGCGGTG GTCCTCCGCC TTCCCCCAGG 240 

TCGTGATGCA GGCGCCATGG G CCGG TAATC GTGGCTGGGC TGGAACGAGG GAGGAAGTGA 3 00 

GAGATATGAG TGAGCATGTA ACAAGATCCC AATCCTCAGA AAGAGGAAAT GACCAAGAGT 360 

CTTCCCAGCC AGTTGGACCT GTGATTGTCC AGCAGCCCAC TGAGGAAAAA CGTCAAGAAG 420 

40 AGGAACCACC AACTGATAAT CAGGGTATTG CACCTAGTGG GGAGATCAAA AATGAAGGAG 480 

CACCTGCTGT TCAAGGGACT GATGTGGAAG CTTTTCAACA GGAACTGGCT CTGCTTAAGA 540 

TAGAGGATGC ACCTGGAGAT GGTCCTGATG TCAGGGAGGG GACTCTGCCC ACTTTTGATC 600 

CCACTAAAGT GCTGGAAGCA GGTGAAGGGC AACTATAGGT TTAAACCAAG ACAAATGAAG 660 

ACTGAAACCA AGAATATTGT TCTTATGCTG GAAATTTGAC TGCTAACATT CTCTTAATAA 720 
4 J AGTTTTACAG TTTTC 

Seq ID NOs 143 Protein Sequence 
Protein Accession #: NP_569734 

50 1 11 21 31 41 51 

I I I I I I 

MSEHVTRSQS SERGNDQESS QPVGPVIVQQ PTEEKRQEEE PPTDNQGIAP SGEIKNEGAP 60 
AVQGTDVEAF QQELALLKIE DAPGDGPDVR EGTLPTFDPT KVLEAGEGQL 

55 Seq ID NO: 144 DNA Sequence 

Nucleic Acid Accession #: NM_001476.1 
Coding sequence: 82.. 435 

1 11 21 31 41 51 

60 | | | | || 

GCCAGGGAGC TGTGAGGCAG TGCTGTGTGG TTCCTGCCGT CCGGACTCTT TTTCCTCTAC 60 
TGAGATTCAT CTGTGTGAAA TATGAGTTGG CGAGGAAGAT CGACCTATTA TTGGCCTAGA 120 
CCAAGGCGCT ATGTACAGCC TCCTGAAGTG ATTGGGCCTA TGCGGCCCGA GCAGTTCAGT 180 
GATGAAGTGG AACCAGCAAC ACCTGAAGAA GGGGAACCAG CAACTCAACG TCAGGATCCT 240 
65 GCAGCTGCTC AGGAGGGAGA GGATGAGGGA GCATCTGCAG GTCAAGGGCC GAAGCCTGAA 300 
GCTGATAGCC AGGAACAGGG TCACCCACAG ACTGGGTGTG AGTGTGAAGA TGGTCCTGAT 360 
GGGCAGGAGG TGGACCCGCC AAATCCAGAG GAGGTGAAAA CGCCTGAAGA AGGTGAAAAG 420 
CAATCACAGT GTTAAAAGAA GACACGTTGA AATGATGCAG GCTGCTCCTA TGTTGGAAAT 480 
TTGTTCATTA AAATTCTCCC AATAAAGCTT TACAGCCTTC TGCAAAA 



Seq ID NO: 145 Protein Sequence 
protein Accession fl: NP_001467.1 



1 U 21 31 41 51 

75 | | I I || 

MSWRGRSTYY WPRPRRYVQP PEVIGPMRPE QFSDEVEPAT PEEGEPATQR QDPAAAQEGE 
DEGASAGQGP KPEADSQEQG HPQTGCECED GPDGQEVDPP NPEBVKTPEE GEKQSQC 



279 



WO 03/003906 



PCT/US02/21338 



Seq ID NO : 146 DNA Sequence 
Nucleic Acid Accession #: NM_005S62 
5 Coding sequence: 90. .3671 

1 11 21 31 41 51 

I I I I I I 

ACAGCGQAGC GCAGAGTGAG AACCACCAAC CGAGGCGCCG GGCAGCGACC CCTGCAGCGG 60 

10 AGACAGAGAC TGAGCGGCCC GGCACCGCCA TGCCTGCGCT CTGGCTGGGC TGCTGCCTCT 120 

GCTTCTCGCT CCTCCTGCCC GCAGCCCGGG CCACCTCCAG GAGGGAAGTC TGTGATTGCA 180 

ATGGGAAGTC CAGGCAGTGT ATCTTTGATC GGGAACTTCA CAGACAAACT GGTAATGGAT 240 

TCCGCTGCCT CAACTGCAAT GACAACACTG ATGGCATTCA CTGCGAGAAG TGCAAGAATG 300 

GCTTTTACCG GCACAGAGAA AGGGACCGCT GTTTGCCCTG CAATTGTAAC TCCAAAGGTT 360 

15 CTCTTAGTGC TCGATGTGAC AACTCTGGAC GGTGCAGCTG TAAACCAGGT GTGACAGGAG 420 

CCAGATGCGA CCGATGTCTG CCAGGCTTCC ACATGCTCAC GGATGCGGGG TGCACCCAAG 480 

ACCAGAGACT GCTAGACTCC AAGTGTGACT GTGACCCAGC TGGCATCGCA GGGCCCTGTG 540 

ACGCGGGCCG CTGTGTCTGC AAGCCAGCTG TTACTGGAGA ACGCTGTGAT AGGTGTCGAT 600 

CAGGTTACTA TAATCTGGAT GGGGGGAACC CTGAGGGCTG TACCCAGTGT TTCTGCTATG 660 

20 GGCATTCAGC CAGCTGCCGC AGCTCTGCAG AATACAGTGT CCATAAGATC ACCTCTACCT 720 

TTCATCAAGA TGTTGATGGC TGGAAGGCTG TCCAACGAAA TGGGTCTCCT GCAAAGCTCC 780 

AATGGTCACA GCGCCATCAA GATGTGTTTA GCTCAGCCCA ACGACTAGAC CCTGTCTATT 840 

TTGTGGCTCC TGCCAAATTT CTTGGGAATC AACAGGTGAG CTATGGGCAA AGCCTGTCCT 900 

TTGACTACCG TGTGGACAGA GGAGGCAGAC ACCCATCTGC CCATGATGTG ATTCTGGAAG 960 

25 GTGCTGGTCT ACGGATCACA GCTCCCTTGA TGCCACTTGG CAAGACACTG CCTTGTGGGC 1020 

TCACCAAGAC TTACACATTC AGGTTAAATG AGCATCCAAG CAATAATTGG AGCCCCCAGC 1080 

TGAGTTACTT TGAGTATCGA AGGTTACTGC GGAATCTCAC AGCCCTCCGC ATCCGAGCTA 1140 

CATATGGAGA ATACAGTACT GGGTACATTG ACAATGTGAC CCTGATTTCA GCCCGCCCTG 1200 

TCTCTGGAGC CCCAGCACCC TGGGTTGAAC AGTGTATATG TCCTGTTGGG TACAAGGGGC 1260 

30 AATTCTGCCA GGATTGTGCT TCTGGCTACA AGAGAGATTC AGCGAGACTG GGGCCTTTTG 1320 

GCACCTGTAT TCCTTGTAAC TGTCAAGGGG GAGGGGCCTG TGATCCAGAC ACAGGAGATT 1380 

GTTATTCAGG GGATGAGAAT CCTGACATTG AGTGTGCTGA CTGCCCAATT GGTTTCTACA 1440 

ACGATCCGCA CGACCCCCGC AGCTGCAAGC CATGTCCCTG TCATAACGGG TTCAGCTGCT 1500 

CAGTGATGCC GGAGACGGAG GAGGTGGTGT GCAATAACTG CCCTCCCGGG GTCACCGGTG 1560 

35 CCCGCTGTGA GCTCTGTGCT GATGGCTACT TTGGGGACCC CTTTGGTGAA CATGGCCCAG 1620 

TGAGGCCTTG TCAGCCCTGT CAATGCAACA ACAATGTGGA CCCCAGTGCC TCTGGGAATT 1680 

GTGACCGGCT GACAGGCAGG TGTTTGAAGT GTATCCACAA CACAGCGGGC ATCTACTGCG 1740 

ACCAGTGCAA AGCAGGCTAC TTCGGGGACC CATTGGCTCC CAACCCAGCA GACAAGTGTC 1800 

GAGCTTGCAA CTGTAACCCC ATGGGCTCAG AGCCTGTAGG ATGTCGAAGT GATGGCACCT 1860 

40 GTGTTTGCAA GCCAGGATTT GGTGGCCCCA ACTGTGAGCA TGGAGCATTC AGCTGTCCAG 1920 

CTTGCTATAA TCAAGTGAAG ATTCAGATGG ATCAGTTTAT GCAGCAGCTT CAGAGAATGG 1980 

AGGCCCTGAT TTCAAAGGCT CAGGGTGGTG ATGGAGTAGT ACCTGATACA GAGCTGGAAG 2040 

GCAGGATGCA GCAGGCTGAG CAGGCCCTTC AGGACATTCT GAGAGATGCC CAGATTTCAG 2100 

AAGGTGCTAG CAGATCCCTT GGTCTCCAGT TGGCCAAGGT GAGGAGCCAA GAGAACAGCT 2160 

45 ACCAGAGCCG CCTGGATGAC CTCAAGATGA CTGTGGAAAG AGTTCGGGCT CTGGGAAGTC 222 0 

AGTACCAGAA CCGAGTTCGG GATACTCACA GGCTCATCAC TCAGATGCAG CTGAGCCTGG 2280 

CAGAAAGTGA AGCTTCCTTG GGAAACACTA ACATTCCTGC CTCAGACCAC TACGTGGGGC 2340 

CAAATGGCTT TAAAAGTCTG GCTCAGGAGG CCACAAGATT AGCAGAAAGC CACGTTGAGT 2400 

^ CAGCCAGTAA CATGGAGCAA CTGACAAGGG AAACTGAGGA CTATTCCAAA CAAGCCCTCT 2460 

50 CACTGGTGCG CAAGGCCCTG CATGAAGGAG TCGGAAGCGG AAGCGGTAGC CCGQACGGTG 2520 

CTGTGGTGCA AGGGCTTGTG GAAAAATTGG AGAAAACCAA GTCCCTGGCC CAGCAGTTGA 2580 

CAAGGGAGGC CACTCAAGCG GAAATTGAAG CAGATAGGTC TTATCAGCAC AGTCTCCGCC 2640 

TCCTGGATTC AGTGTCTCGG CTTCAGGGAG TCAGTGATCA GTCCTTTCAG GTGGAAGAAG 2700 

CAAAGAGGAT CAAACAAAAA GOGGATTCAC TCTCAACGCT GGTAACCAGG CATATGGATG 2760 

55 AGTTCAAGCG TACACAAAAG AATCTGGGAA ACTGGAAAGA AGAAGCACAG CAGCTCTTAC 2820 

AGAATGGAAA AAGTGGGAGA GAGAAATCAG ATCAGCTGCT TTCCCGTGCC AATCTTGCTA 2880 

AAAGCAGAGC ACAAGAAGCA CTGAGTATGG GCAATGCCAC TTTTTATGAA GTTGAGAGCA 2940 

TCCTTAAAAA CCTCAGAGAG TTTGACCTGC AGGTGGACAA CAGAAAAGCA GAAGCTGAAG 3000 

AAGCCATGAA GAGACTCTCC TACATCAGCC AGAAGGTTTC AGATGCCAGT GACAAGACCC 3060 

60 AGCAAGCAGA AAGAGCCCTG GGGAGCGCTG CTGCTGATGC ACAGAGGGCA AAGAATGGGG 3120 

CCGGGGAGGC CCTGGAAATC TCCAGTGAGA TTGAACAGGA GATTGGGAGT CTGAACTTGG 3180 

AAGCCAATGT GACAGCAGAT GGAGCCTTGG CCATGGAAAA GGGACTGGCC TCTCTGAAGA 3240 

GTGAGATGAG GGAAGTGGAA GGAGAGCTGG AAAGGAAGGA GCTGGAGTTT GACACGAATA 3300 

TGGATGCAGT ACAGATGGTG ATTACAGAAG CCCAGAAGGT TGATACCAGA GCCAAGAACG 3360 

65 CTGGGGTTAC AATCCAAGAC ACACTCAACA CATTAGACGG CCTCCTGCAT CTGATGGACC 3420 

AGCCTCTCAG TGTAGATGAA GAGGGGCTGG TCTTACTGGA GCAGAAGCTT TCCCGAGCCA 34 80 

AGACCCAGAT CAACAGCCAA CTGCGGCCCA TGATGTCAGA GCTGGAAGAG AGGGCACGTC 3540 

AGCAGAGGGG CCACCTCCAT TTGCTGGAGA CAAGCATAGA TGGGATTCTG GCTGATGTGA 3600 

AGAACTTGGA GAACATTAGG GACAACCTGC CCCCAGGCTG CTACAATACC CAGGCTCTTG 3660 

70 AGCAAGAGTG AAGCTGCCAT AAATATTTCT CAACTGAGGT TCTTGGGATA CAGATCTCAG 3720 

GGCTCGGGAG CCATGTCATG TGAGTGGGTG GGATGGGGAC ATTTGAACAT GTTTAATGGG 3780 

TATGCTCAGG TCAACTGACC TGACCCCATT CCTGATCCCA TGGCCAGGTG GTTGTCTTAT 3840 

TGCACCATAC TCCTTGCTTC CTGATGCTGG GCAATGAGGC AGATAGCACT GGGTGTGAGA 3900 

nc ATGATCAAGG ATCTGGACCC CAAAGAATAG ACTGGATGGA AAGACAAACT GCACAGGCAG 3960 

75 ATGTTTGCCT CATAATAGTC GTAAGTGGAG TCCTGGAATT TGGACAAGTG CTGTTGGGAT 4020 

ATAGTCAACT TATTCTTTGA GTAATGTGAC TAAAGGAAAA AACTTTGACT TTGCCCAGGC 4080 

ATGAAATTCT TCCTAATGTC AGAACAGAGT GCAACCCAGT CACACTGTGG CCAGTAAAAT 4140 

ACTATTGCCT CATATTGTCC TCTGCAAGCT TCTTGCTGAT CAGAGTTCCT CCTACTTACA 4200 



280 



WO 03/003906 



PCT/US02/21338 



10 



25 



ACCCAGGGTG TGAACATGTT CTCCATTTTC AAGCTGGAAG AAGTGAGCAG TGTTGGAGTG 4260 

AGGACCTGTA AGGCAGGCCC ATTCAGAGCT ATGGTGCTTG CTGGTGCCTG CCACCTTCAA 4320 

GTTCTGGACC TGGGCATGAC ATCCTTTCTT TTAATGATGC CATGGCAACT TAGAGATTGC 4380 

ATTTTTATTA AAGCATTTCC TACCAGCAAA GCAAATGTTG GGAAAGTATT TACTTTTTCG 4440 

GTTTCAAAGT GATAGAAAAG TGTGGCTTGG GCATTGAAAG AGGTAAAATT CTCTAGATTT 4500 

ATTAGTCCTA ATTCAATCCT ACTTTTCGAA CACCAAAAAT GATGCGCATC AATGTATTTT 4560 

ATCTTATTTT CTCAATCTCC TCTCTCTTTC CTCCACCCAT AATAAGAGAA TGTTCCTACT 4620 

CACACTTCAG CTGGGTCACA TCCATCCCTC CATTCATCCT TCCATCCATC TTTCCATCCA 4680 

TTACCTCCAT CCATCCTTCC AACATATATT TATTGAGTAC CTACTGTGTG CCAGGGGCTG 4740 

GTGGGACAGT GGTGACATAG TCTCTGCCCT CATAGAGTTG ATTGTCTAGT GAGGAAGACA 4800 

AGCATTTTTA AAAAATAAAT TTAAACTTAC AAACTTTGTT TGTCACAAGT GGTGTTTATT 4860 

GCAATAACCG CTTGGTTTGC AACCTCTTTG CTCAACAGAA CATATGTTGC AAGACCCTCC 4 920 

CATGGGGGCA CTTGAGTTTT GGCAAGGCTG ACAGAGCTCT GGGTTGTGCA CATTTCTTTG 4980 

CATTCCAGCT GTCACTCTGT GCCTTTCTAC AACTGATTGC AACAGACTGT TGAGTTATGA 5040 

15 TAACACCAGT GGGAATTGCT GGAGGAACCA GAGGCACTTC CACCTTGGCT GGGAAGACTA 5X00 

TGGTGCTGCC TTGCTTCTGT ATTTCCTTGG ATTTTCCTGA AAGTGTTTTT AAATAAAGAA 5160 
CAATTGTTAG ATGCC 

Seq ID NO: 147 Protein Sequence 
20 Protein Accession #: NP_005553 

1 11 21 31 41 51 

I I I I 1 I 

MPALWLGCCL CFSLLLPAAR ATSRREVCDC NGKSRQCIFD RELHRQTGNG FRCLNCNDNT 60 

DGIHCEKCKN GFYRHRERDR CLPCNCNSKG SLSARCDNSG RCSCKPGVTG ARCDRCLPGP 120 

HMLTDAGCTQ DQRLLDSKCD CDPAGIAGPC DAGRCVCKPA VTGERCDRCR SGYYNLDGGN 180 

PEGCTQCFCY GHSASCRSSA EYSVHKITST FHQDVDGWKA VQRNGSPAKL QWSQRHQDVF 240 

SSAQRLDPVY FVAPAKFLGN QQVSYGQSLS FDYRVDRGGR HPSAHDVILE GAGLRITAPL 300 

MPLGKTLPCG LTKTYTFRLN EHPSNNWSPQ LSYFEYRRLL RNLTALRIRA TYGEYSTGYI 360 

30 DNVTLISARP VSGAPAPWVE QCICPVGYKG QFOQDCASGY KRDSARLGPF GTCIPCNCQG 420 

GGACDPDTGO CYSGDENPDI ECADCPIGFY NDPHDPRSCK PCPCHNGFSC SVMPETEEW 480 

CNNCPPGVTG ARCELCADGY FGDPFGEHGP VRPCQPCQCN NNVDPSASGN CDRLTGRCLK 540 

CIHNTAGIYC DQCKAGYFGD PLAPNPADKC RACNCNPMGS EPVGCRSDGT CVCKPGFGGP 600 

NCEHGAFSCP ACYNQVKIQM DQFMQQLQRM EALISKAQGG DGWPDTELE GRMQQAEQAL 660 

35 QDILRDAQIS EGASRSLGLQ LAKVRSQENS YQSRLDDLKM TVERVRALGS QYQNRVRDTH 720 

RLITQMQLSL AESEASLGNT NIPASDHYVG PNGFKSLAQE ATRLAESHVE SASNMEQLTR 780 

ETEDYSKQAL SLVRKALHEG VGSGSGSPDG AWQGLVEKL EKTKSLAQQl* TREATQAEIE 840 

ADRSYQHSLR LLDSVSRLQG VSDQSPQVEE AKRI KQKADS LSTLVTRHMD EFKRTQKNLG 900 

NWKEEAQQLL QNGKSGREKS DQLLSRANLA KSRAQEALSM GNATFYEVES ILKNLREFDL 960 

40 QVDNRKAEAE EAMKRLSYIS QKVSDASDKT QQAERALGSA AADAQRAKNG AGEALEISSE 1020 

IEQEIGSLNL EANVTADGAL AMEKGLASLK SEMREVEGEL ERKELEFDTN MDAVQMVITE 1080 

AQKVDTRAKN AGVTIQDTLK TLDGLLHLMD QPLSVDEEGL VLLEQKLSRA KTQINSQLRP 1140 
MMSELEERAR QQRGHLHLLE TSIDGILADV KNLENIRDNL PPGCYNTQAL EQQ 

45 Seq ID NO: 148 DNA Sequence 

Nucleic Acid Accession #: NM_021048.2 
Coding sequence: 327.. 1436 

1 11 21 31 41 51 

50 | ! | | || 

GGCACGAGGG AGAAGCGAGG TTCTCGTTCT GAGGGACAGG CTTGAGATCG GCTGAAGAGA 60 

GCGGGCCCAG GCTCTGTGAG GAGGCAAGGG AGGTGAGAAC CTTGCTCTCA GAGGGTGACT 120 

CAAGTCAACA CAGGGAACCC CTCTTTTCTA CAGACACAGT GGGTCGCAGG ATCTGACAAG 180 

AGTCCAGGTT CTCAGGGGAC AGGGAGAGCA AGAGGTCAAG AGCTGTGGGA CACCACAGAG 240 

55 CAGCACTGAA GGAGAAGACC TGCCTGTGGG TCCCCATCGC CCAAGTCCTG CCCACACTCC 300 

CACCTGCTAC CCTGATCAGA GTCATCATGC CTCGAGCTCC AAAGCGTCAG CGCTGCATGC 360 

CTGAAGAAGA TCTTCAATCC CAAAGTGAGA CACAGGGCCT CGAGGGTGCA CAGGCTCCCC 420 

TGGCTGTGGA GGAGGATGCT TCATCATCCA CTTCCACCAG CTCCTCTTTT CCATCCTCTT 4 B0 

TTCCCTCCTC CTCCTCTTCC TCCTCCTCCT CCTGCTATCC TCTAATACCA AGCACCCCAG 540 

60 AGGAGGTTTC TGCTGATGAT GAGACACCAA ATCCTCCCCA GAGTGCTCAG ATAGCCTGCT 600 

CCTCCCCCTC GGTCGTTGCT TCCCTTCCAT TAGATCAATC TGATGAGGGC TCCAGCAGCC 660 

AAAAGGAGGA GAGTCCAAGC ACCCTACAGG TCCTGCCAGA CAGTGAGTCT TTACCCAGAA 720 

GTGAGATAGA TGAAAAGGTG ACTGATTTGG TGCAGTTTCT GCTCTTCAAG TATCAAATGA 780 

AGGAGCCGAT CACAAAGGCA GAAATACTGG AGAGTGTCAT AAAAAATTAT GAAGACCACT 840 • 

65 TCCCTTTGTT GTTTAGTGAA GCCTCCGAGT GCATGCTGCT GGTCTTTGGC ATTGATGTAA 900 

AGGAAGTGGA TCCCACTGGC CACTCCTTTG TCCTTGTCAC CTCCCTGGGC CTCACCTATG 960 

ATGGGATGCT GAGTGATGTC CAGAGCATGC CCAAGACTGG CATTCTCATA CTTATCCTAA 1020 

GCATAATCTT CATAGAGGGC TACTGCACCC CTGAGGAGGT CATCTGGGAA GCACTGAATA 1080 

TGATGGGGCT GTATGATGGG ATGGAGCACC TCATTTATGG GGAGCCCAGG AAGCTGCTCA 1140 

70 CCCAAGATTG GGTGCAGGAA AACTACCTGG AGTACCGGCA GGTGCCTGGC AGTGATCCTG 1200 

CACGGTATGA GTTTCTGTGG GGTCCAAGGG CTCATGCTGA AATTAGGAAG ATGAGTCTCC 1260 

TGAAATTTTT GGCCAAGGTA AATGGGAGTG ATCCAAGATC CTTCCCACTG TGGTATGAGG 1320 

AGGCTTTGAA AGATGAGGAA GAGAGAGCCC AGGACAGAAT TGCCACCACA GATGATACTA 1380 

_ CTGCCATGGC CAGTGCAAGT TCTAGCGCTA CAGGTAGCTT CTCCTACCCT GAATAAAGTA 1440 

75 AGACAGATTC TTCACTGTGT TTTAAAAGGC AAGTCAAATA CCACATGATT TTACTCATAT 1500 
GTGGAATCTA AAAAAAAAAA AAAAAAAA 
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Seq ID NO: 149 Protein Sequence 
Protein Accession ft: NP_066386.1 

1 11 21 31 41 51 

5 j I I ! I t 

MPRAPKRQRC MPEEDLQSQS ETQGLEGAQA PLAVEEDASS STSTSSSFPS SFPSSSSSSS 60 
SSCYPLIPST PEEVSADDET PNPPQSAQIA CSSPSWASL PLDQSDEGSS SQKEESPSTL 120 
QVLPDSESLP RSEIDEKVTD LVQFLLFKYQ MKEPITKAEI LESVIKNYED HFPLLFSEAS 180 
ECMLLVFGID VKEVDPTGHS FVLVTSLGLT YDGMLSDVQS MPKTGILILI LSIIFIEGYC 240 
10 TPEEVIWEAL NMMGLYDGME HLIYGEPRKL LTQDWVQENY LEYRQVPGSD PARYEFLWGP 300 
RAHABIRKMS LLKFLAKVNG SDPRSFPLWY EEALKDEEER AQDRIATTDD TTAMASASSS 3 60 
ATGSFSYPE 

Seq ID NO: 150 DNA Sequence 
15 Nucleic Acid Accession ft: NM_003695 
Coding sequence: 12. .398 

1 11 21 31 41 51 

on I 1 I I I I 

Z(J CGACATCAGA GATGAGGACA GCATTGCTGC TCCTTGCAGC CCTGGCTGTG GCTACAGGGC 60 

CAGCCCTTAC CCTGCGCTGC CACGTGTGCA CCAGCTCCAG CAACTGCAAG CATTCTGTGG 120 

TCTGCCCGGC CAGCTCTCGC TTCTGCAAGA CCACGAACAC AGTGGAGCCT CTGAGGGGGA 180 

ATCTGGTGAA GAAGGACTGT GCGGAGTCGT GCACACCCAG CTACACCCTG CAAGGCCAGG 240 

TCAGCAGCGG CACCAGCTCC ACCCAGTGCT GCCAGGAGGA CCTGTGCAAT GAGAAGCTGC 300 

25 ACAACGCTGC ACCCACCCGC ACCGCCCTCG CCCACAGTGC CCTCAGCCTG GGGCTGGCCC 360 

TGAGCCTCCT GGCCGTCATC TTAGCCCCCA GCCTGTGACC TTCCCCCCAG GGAAGGCCCC 420 

TCATGCCTTT CCTTCCCTTT CTCTGGGGAT TCCACACCTC TCTTCCCCAG CCGGCAACGG 480 

GGGTGCCAGG AGCCCCAGGC TGAGGGCTTC CCCGAAAGTC TGGGACCAGG TCCAGGTGGG 540 

CATGGAATGC TGATGACTTG GAGCAGGCCC CACAGACCCC ACAGAGGATG AAGCCACCCC 600 

30 ACAGAGGATG CAGCCCCCAG CTGCATGGAA GGTGGAGGAC AGAAGCCCTG TGGATCCCCG 660 

GATTTCACAC TCCTTCTGTT TTGTTGCCGT TTATTTTGTA CTCAAATCTC TACATGGAGA 720 
TAAATGATTT AAACC 

Seq ID NO: 151 Protein Sequence 
35 Protein Accession ft: NP_003686 

1 11 21 31 41 51 

I 1 I I I I 

MRTALLLLAA LAVATGPALT LRCHVCTSSS NCKHSWCPA SSRFCKTTNT VEPLRGNLVK 60 
40 KDCAESCTPS YTLQGQVSSG TSSTQCCQED LCNEKLHNAA PTRTALAHSA LSLGLALSLL 120 
AVILAPSL 

Seq ID NO: 152 DNA Sequence 
Nucleic Acid Accession ft: NMJ306398.1 
45 Coding sequence: 19.. 516 

1 11 21 31 41 51 

I I I I I I 

GGCCCCTTGT CTGCAGAGAT GGCTCCCAAT GCTTCCTGCC TCTGTGTGCA TGTCCGTTCC 60 

50 GAGGAATGGG ATTTAATGAC CTTTGATGCC AACCCATATG ACAGCGTGAA AAAAATCAAA 120 

GAACATGTCC GGTCTAAGAC CAAGGTTCCT GTGCAGGACC AGGTTCTTTT GCTGGGCTCC 180 

AAGATCTTAA AGCCACGGAG AAGCCTCTCA TCTTATGGCA TTGACAAAGA GAAGACCATC 240 

CACCTTACCC TGAAAGTGGT GAAGCCCAGT GATGAGGAGC TGCCCTTGTT TCTTGTGGAG 300 

TCAGGTGATG AGGCAAAGAG GCACCTCCTC CAGGTGCGAA GGTCCAGCTC AGTGGCACAA 360 

55 GTGAAAGCAA TGATCGAGAC TAAGACGGGT ATAATCCCTG AGACCCAGAT TGTGACTTGC 420 
AATGGAAAGA GACTGGAAGA TGGGAAGATG ATGGCAGATT ACGGCATCAG AAAGGGCAAC " 480 

TTACTCTTCC TGGCATCTTA TTGTATTGGA GGGTGACCAC CCTGGGGATG GGGTGTTGGC 540 

AGGGGTCAAA AAGCTTATTT CTTTTAATCT CTTACTCAAC GAACACATCT TCTGATGATT 600 

TCCCAAAATT AATGAGAATG AGATGAGTAG AGTAAGATTT GGGTGGGATG GGTAGGATGA 660 

60 AGTATATTGC CCAACTCTAT GTTTCTTTGA TTCTAACACA ATTAATTAAG TGACATGATT 720 
TTTACTAATG TATTACTGAG ACTAGTAAAT AAATTTTTAA GGCAAAATAG AGCATTC 



65 



75 



Seq ID NO: 153 Protein Sequence 
Protein Accession ft: NP_006389.1 



1 11 21 31 41 51 

I I I I I I 

MAPNASCLCV HVRSEEWDLM TFDANPYDSV KKIKEHVRSK TKVPVQDQVL LLGSKILKPR 60 
RSL.SSYGIDK EKTIHLTLKV VKPSDEELPL FLVESGDEAK RHLLQVRRSS SVAQVKAMIE 120 
70 TKTGIIPETQ IVTCNGKRLE DGKMMADYGI RKGNLLFLAS YCIGG 



Seq ID NO: 154 DNA Sequence 
Nucleic Acid Accession ft: BC017490.1 
Coding sequence: 74.-2788 

1 11 21 31 41 51 

I I 1 I I I 

GTGGGTCACG TGAACCACTT TTCGCGCGAA ACCTGGTTGT TGCTGTAGTG GCGGAGAGGA 60 
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TOGTGGTACT GCTATGGCGG AATCATCGGA ATCCTTCACC ATGGCATCCA GCCCGGCCCA 120 

GCGTCGGCGA GGCAATGATC CTCTCACCTC CAGCCCTGGC CGAAGCTCCC GGCGTACTGA 180 

TGCCCTCACC TCCAGCCCTG GCCGTGACCT TCCACCATTT GAGGATGAGT CCGAGGGGCT 240 

CC7AGGCACA GAGGGGCCCC TGGAGGAAGA AGAGGATGGA GAGGAGCTCA TTGGAGATGG 300 

5 CATGGAAAGG GACTACOGCG CCATCCCAGA GCTGGACGCC TATGAGGCCG AGGGACTGGC 360 

TCTGGATGAT GAGGACGTAG AGGAGCTGAC GGCCAGTCAG AGGGAGGCAG CAGAGCGGGC 420 

CATGCGGCAG CGTGACCGGG AGGCTGGCCG GGGCCTGGGC CGCATGCGCC GTGGGCTCCT 480 

GTATGACAGC GATGAGGAGG ACGAGGAGCG CCCTGCCCGC AAGCGCCGCC AGGTGGAGCG 540 

GGCCACGGAG GACGGCGAGG AGGACGAGGA GATGATCGAG AGCATCGAGA ACCTGGAGGA 600 

10 TCTCAAAGGC CACTCTGTGC GCGAGTGGGT GAGCATGGCG GGCCCCCGGC TGGAGATCCA 660 

CCACCGCTTC AAGAACTTCC TGCGCACTCA CGTCGACAGC CACGGCCACA ACGTCTTCAA 720 

GGAGCGCATC AGCGACATGT GCAAAGAGAA CCGTGAGAGC CTGGTGGTGA ACTATGAGGA 780 

CTTGGCAGCC AGGGAGCACG TGCTGGCCTA CTTCCTGCCT GAGGCACCGG CGGAGCTGCT 840 

GCAGATCTTT GATGAGGCTG CCCTGGAGGT GGTACTGGCC ATGTACCCCA AGTACGACCG 900 

15 CATCACCAAC CACATCCATG TCCGCATCTC CCACCTGCCT CTGGTGGAGG AGCTGCGCTC 960 

GCTGAGGCAG CTGCATCTGA ACCAGCTGAT CCGCACCAGT GGGGTGGTGA CCAGCTGCAC 1020 

TGGCGTCCTG CCCCAGCTCA GCATGGTCAA GTACAACTGC AACAAGTGCA ATTTCGTCCT 1080 

GGGTCCTTTC TGCCAGTCCC AGAACCAGGA GGTGAAACCA GGCTCCTGTC CTGAGTGCCA 1140 

GTCGGCCGGC CCCTTTGAGG TCAACATGGA GGAGACCATC TATCAGAACT ACCAGCGTAT 1200 

20 CCGAATCCAG GAGAGTCCAG GCAAAGTGGC GGCTGGCCGG CTGCCCCGCT CCAAGGACGC 1260 

CATTCTCCTC GCAGATCTGG TGGACAGCTG CAAGCCAGGA GACGAGATAG AGCTGACTGG 1320 

CATCTATCAC AACAACTATG ATGGCTCCCT CAACACTGCC AATGGCTTCC CTGTCTTTGC 1380 

CACTGTCATC CTAGCCAACC ACGTGGCCAA GAAGGACAAC AAGGTTGCTG TAGGGGAACT 1440 

GACCGATGAA GATGTGAAGA TGATCACTAG CCTCTCCAAG GATCAGCAGA TCGGAGAGAA 1500 

25 GATCTTTGCC AGCATTGCTC CTTCCATCTA TGGTCATGAA GACATCAAGA GAGGCCTGGC 1560 

TCTGGCCCTG TTCGGAGGGG AGCCCAAAAA CCCAGGTGGC AAGCACAAGG TACGTGGTGA 1620 

TATCAACGTG CTCTTGTGCG GAGACCCTGG CACAGCGAAG TCGCAGTTTC TCAAGTATAT 1680 

TGAGAAAGTG TCCAGCCGAG CCATCTTCAC CACTGGCCAG GGGGCGTCGG CTGTGGGCCT 1740 

CACGGCGTAT GTCCAGCGGC ACCCTGTCAG CAGGGAGTGG ACCTTGGAGG CTGGGGCCCT 1800 

30 GGTTCTGGCT GACCGAGGAG TGTGTCTCAT TGATGAATTT GACAAGATGA ATGACCAGGA 1860 . 

CAGAACCAGC ATCCATGAGG CCATGGAGCA ACAGAGCATC TCCATCTCGA AGGCTGGCAT 1920 

CGTCACCTCC CTGCAGGCTC GCTGCACGGT CATTGCTGCC GCCAACCCCA TAGGAGGGCQ 1980 

CTACGACCCC TOGCTGACTT TCTCTGAGAA CGTGGACCTC ACAGAGCCCA TCATCTCACG 2040 

CTTTGACATC CTGTGTGTGG TGAGGGACAC CGTGGACCCA GTCCAGGACG AGATGCTGGC 2100 

35 CCGCTTCGTG GTGGGCAGCC ACGTCAGACA CCACCCCAGC AACAAGGAGG AGGAGGGGCT 2160 

GGCCAATGGC AGCGCTGCTG AGCCCGCCAT GCCCAACACG TATGGCGTGG AGCCCCTGCC 2220 

CCAGGAGGTC CTGAAGAAGT ACATCATCTA CGCCAAGGAG AGGGTCCACC OGAAGCTCAA 2280 

CCAGATGGAC CAGGACAAGG TGGCCAAGAT GTACAGTGAC CTGAGGAAAG AATCTATGGC 2340 

GACAGGCAGC ATCCCCATTA CGGTGCGGCA CATCGAGTCC ATGATCCGCA TGGCGGAGGC 2400 

40 CCACGCGCGC ATCCATCTGC GGGACTATGT ' GATCGAAGAC GACGTCAACA TGGCCATCCG 2460 

CGTGATGCTG GAGAGCTTCA TAGACACACA GAAGTTCAGC GTCATGCGCA GCATGCGCAA 2520 

GACTTTTGCC CGCTACCTTT CATTCCGGCG TGACAACAAT GAGCTGTTGC TCTTCATACT 2580 

GAAGCAGTTA GTGGCAGAGC AGGTGACATA TCAGCGCAAC CGCTTTGGGG CCCAGCAGGA 2640 

CACTATTGAG GTCCCTGAGA AGGACTTGGT GGATAAGGCT CGTCAGATCA ACATCCACAA 2700 

45 CCTCTCTGCA TTTTATGACA GTGAGCTCTT CAGGATGAAC AAGTTCAGCC ACGACCTGAA 2760 

AAGGAAAATG ATCCTGCAGC AGTTCTGAGG CCCTATGCCA TCCATAAGGA TTCCTTGGGA 2820 

TTCTGGTTTG GGGTGGTCAG TGCCCTCTGT GCTTTATGGA CACAAAACCA GAGCACTTGA 2880 

TGAACTCGGG GTACTAGGGT CAGGGCTTAT AGCAGGATGT CTGGCTGCAC CTGGCATGAC 2940 

TGTTTGTTTC TCCAAGCCTG CTTTGTGCTT CTCACCTTTG GGTGGGATGC CTTGCCAGTG 3000 

50 TGTCTTACTT GGTTGCTGAA CATCTTGCCA CCTCCGAGTG CTTTGTCTCC ACTCAGTACC 3060 

TTGGATCAGA GCTGCTGAGT TCAGGATGCC TGCGTGTGGT TTAGGTGTTA GCC TTCTTA C 3120 

ATGGATGTCA GGAGAGCTGC TGCCCTCTTG GCGTGAGTTG CGTATTCAGG CTGCTTTTGC 3180 

TGCCTTTGGC CAGAGAGCTG GTTGAAGATG TTTGTAATCG TTTTCAGTCT CCTGCAGGTT 3240 

TCTGTGCCCC TGTGGTGGAA GAGGGCACGA CAGTGCCAGC GCAGCGTTCT GGGCTCCTCA 3300 

55 GTCGCAGGGG TGGGATGTGA GTCATGCGGA TTATCCACTC GCCACAGTTA TCAGCTGCCA 3360 

TTGCTCCCTG TCTGTTTCCC CACTCTCTTA TTTGTGCATT CGGTTTGGTT TCTGTAGTTT 3420 
TAATTTTTAA TAAAGTTGAA TAAAATATAA AAAAAAAAAA AAAAAA 

Seq ID NO: 155 Protein Sequence 
60 Protein Accession ft: AAH17490.1 

1 11 21 31 41 51 

I I I I I ! 

MAESSESFTM ASSPAQRRRG NDPLTSSPGR SSRRTDALTS SPGRDLPPFE DESEGLLGTB 60 

65 GPLEEEBDGE ELIGDGMERD YRAIPELDAY EAEGLALDDE DVEELTASQR EAAERAMRQR 120 

DREAGRGLGR MKRGhLYDSD EEDEERPARK RRQVERATED GEEDEEMIES IENLEDLKGH 180 

SVREWVSMAG PRLEIHHRFK NFLRTHVDSH GHNVFKERIS DMCKENRESL WNYEDLAAR 240 

EHVLAYFLPE APAELLQIFD EAALEWLAM YPKYDRITNH IHVRISHLPL VEELRSLRQL 300 

HXjNQLIRTSG WTSCTGVLP QLSMVKYNCN KCNFVLGPFC QSQNQEVKPG SCPECQSAGP 360 

70 FEVNMEETIY QNYQRIRIQE SPGKVAAGRL PRSKDAILLA DLVDSCKPGD EIELTGIYHN 420 

NYDGSLNTAN GFPVFATVIL ANHVAKKDNR VAVGELTDED VKMITSLSKD QQIGEKIFAS 480 

IAPSIYGHED IKRGLALALF GGEPKNPGGK HKVRGDINVL LCGDPGTAKS QFLKYIEKVS 540 

SRAIFTTGQG ASAVGLTAYV QRHPVSREWT LEAGALVLAD RGVCLIDEFD KMNDQDRTSI 600 

HEAMEQQSIS ISKAGIVTSL QARCTVIAAA NPIGGRYDPS LTFSENVDLT EPIISRFDIL 660 

75 CWRDTVDPV QDEMLARFW GSHVRHHPSN KEEEGLANGS AAEPAMPNTY GVEPLPQEVL 720 

KKYIIYAKER VHPKLNQMDQ DKVAKMYSDL RKESMATGSI PITVRHIESM IRMAEAHARI 780 

RLRDYVIEDD VNMAIRVMLE SFIDTQKFSV MRSMRKTFAR YLSFRRDNNE LLLFILKQLV 840 

AEQVTYQRNR FGAQQDTIEV PEKDLVDKAR QINIHNLSAF YDSELFRMNK FSKDLKRKMI 900 
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Seq ID NO i 156 dna Sequence 
Nucleic Acid Accession gs NMJ)02497.1 
5 Coding sequence: 13S..1472 ~ 

1 11 21 31 41 51 

I I I I I I 

GGCACGAGTA GGGGTGGCGG GTCAGTGCTG CTCGGGGGCT TCTCCATCCA GGTCCCTGGA SO 

10 GTTCCTGGTC CCTGGAGCTC CGCACTTGGC GCGCAACCTG CGTGAGGCAG CGCGACTCTG 120 

GCGACTGGCC GGCCATGCCT TCCCGGGCTG AGGACTATGA AGTGTTGTAC ACCATTGGCA 180 

CAGGCTCCTA CGGCCGCTGC CAGAAGATCC GGAGGAAGAG TGATGGCAAG ATATTAGTTT 240 

GGAAAGAACT TGACTATGGC TCCATGACAG AAGCTGAGAA ACAGATGCTT GTTTCTGAAG 300 

TGAATTTGCT TCGTGAACTG AAACATCCAA ACATOGTTCG TTACTATGAT CGGATTATTG 360 

15 ACCGGACCAA TACAACACTG TACATTGTAA TGGAATATTG TGAAGGAGGG GATCTGGCTA 420 

GTGTAATTAC AAAGGGAACC AAGGAAAGGC AATACTTAGA TGAAGAGTTT GTTCTTCGAG 480 

TGATGACTCA GTTGACTCTG GCCCTGAAGG AATGCCACAG ACGAAGTGAT GGTGGTCATA 540 

CCGTATTGCA TOGGGATCTT AAACCAGCCA ATGTTTTCCT GGATGGCAAG CAAAACGTCA 600 

AGCTTGGAGA CTTTGGGCTA GCTAGAATAT TAAACCATGA CACGAGTTTT GCAAAAACAT 660 

ZU TTGTTGGCAC ACCTTATTAC ATGTCTCCTG AACAAATGAA TCGCATGTCC TACAATGAGA 720 

AATCAGATAT CTGGTCATTG GGCTGCTTGC TGTATGAGTT ATGTGCATTA ATGCCTCCAT 780 

TTACAGCTTT TAGCCAGAAA GAACTCGCTG GGAAAATCAG AGAAGGCAAA TTCAGGCGAA 840 

TTCCATACCG TTACTCTGAT GAATTGAATG AAATTATTAC GAGGATGTTA AACTTAAAGG 900 

ATTACCATCG ACCTTCTGTT GAAGAAATTC TTGAGAACCC TTTAATAGCA GATTTGGTTG 960 

CAGACGAGCA AAGAAGAAAT CTTGAGAGAA GAGGGCGACA ATTAGGAGAG CCAGAAAAAT 1020 

CGCAGGATTC CAGCCCTGTA TTGAGTGAGC TGAAACTGAA GGAAATTCAG TTACAGGAGC 1080 

GAGAGCGAGC TCTCAAAGCA AGAGAAGAAA GATTGGAGCA GAAAGAACAG GAGCTTTGTG 1140 

TTCGTGAGAG ACTAGCAGAG GACAAACTGG CTAGAGCAGA AAATCTGTTG AAGAACTACA 1200 

GCTTGCTAAA GGAACGGAAG TTCCTGTCTC TGGCAAGTAA TCCAGAACTT CTTAATCTTC 1260 

CATCCTCAGT AATTAAGAAG AAAGTTCATT TCAGTGGGGA AAGTAAAGAG AACATCATGA 1320 

GGAGTGAGAA TTCTGAGAGT CAGCTCACAT CTAAGTCCAA GTGCAAGGAC CTGAAGAAAA 1380 

GGCTTCACGC TGCCCAGCTG CGGGCTCAAG CCCTGTCAGA TATTGAGAAA AATTACCAAC 1440 

TGAAAAGCAG ACAGATCCTG GGCATGCGCT AGCCAGGTAG AGAGACACAG AGCTGTGTAC 1500 

AGGATGTAAT ATTACCAACC TTTAAAGACT GATATTCAAA TGCTGTAGTG TTGAATACTT 1560 

GGCCCCATGA GCCATGCCTT TCTGTATAGT ACACATGATA TTTCGGAATT GGTTTTACTG 1620 

TTCTTCAGCA ACTATTGTAC AAAATGTTCA CATTTAATTT TTCTTTCTTC TTTTAAGAAC 1680 

ATATTATAAA AAGAATACTT TCTTGGTTGG GCTTTTAATC CTGTGTGTGA TTACTAGTAG 1740 

GAACATGAGA TGTGACATTC TAAATCTTGG GAGAAAAAAT AATATTAGGA AAAAAATATT 1800 

TATGCAGGAA GAGTAGCACT CACTGAATAG TTTTAAATGA CTGAGTGGTA TGCTTACAAT 1860 

TGTCATGTCT AGATTTAAAT TTTAAGTCTG AGATTTTAAA TGTTTTTGAG CTTAGAAAAC 1920 

CCAGTTAGAT GCAATTTGGT CATTAATACC ATGACATCTT GCTTATAAAT ATTCCATTGC 1980 

TCTGTAGTTC AAATCTGTTA GCTTTGTGAA AATTCATCAC TGTGATGTTT GTATTCTTTT 2040 

TTTTTTTCTG TTTAACAGAA TATGAGCTGT CTGTCATTTA CCTACTTCTT TCCCACTAAA 2100 
TAAAAGAATT CTTCAGTTA 
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Seq ID NO; 157 Protein Sequence 
Protein Accession ft: NP_002488.1 



1 11 21 31 41 51 

50 | | | | | | 

MPSRAEDYEV LYTIGTGSYG RCQKIRRKSD GKILVWKELD YGSMTEAEKQ MLVSEVNLLR 60 

ELKHPNIVRY YDRIIDRTNT TLYIVMEYCE GGDLASVITK GTKERQYLDE EFVLRVMTQL 120 

TLALKECHRR SDGGHTVLHR DLKPANVFLD GKQNVKLGDF GLARIUJHDT SFAKTFVGTP 180 

YYMSPEQMNR MSYNEKSDIW SLGCLLYELC ALMPPFTAPS QKELAGKIRE GKFRRIPYRY 240 

SDELNTEI ITR MLNLKDYHRP SVEEILENPL IADLVADEQR RNLERRGRQL GEPEKSQDSS 300 

PVLSELKLKE IQLQERERAL KAREERLEQK EQELCVRERL AEDKLARAEN LLKNYSLLKE 360 

RKPLSLASNP ELLNLPSSVI KKKVHFSGES KENIMRSENS ESQLTSKSKC KDLKKRLHAA 420 
QLRAQALSDI EKNYQLKSRQ ILGMR 



Seq ID NO: 158 DNA Sequence 
Protein Accession #: Eos sequence 

1 11 21 31 41 51 

I I I I I I 

TGAGTTTGCC CCCTTACCCC CATCCCAGTG AATATTTGCA ATTCCTAAAG AOGTGTTTTG 60 

ATTGTCACAC CTGGGTGGGG AACATGCTAC TGGCATCTAA TGCATAGAGG GCAGTAATGC 120 

TGCTAAACAT CTTTCAACGC ACAGGACAGA GCCCCACAAA AGAGAATTAT CTAGCCCCAA 180 

ATGTCCATAA CACTGCTGTT GAGAAAACCT ACCGCAGGAT CTTACTGGGC TTCATAGGTA 240 

AGCTTGCCTT TGTTCTGGCT TCTGTAGATA TATAAAATAA AGACACTGCC CAGTCCCTCC 300 

CTCAACGTCC CGAGCCAGGG CTCAAGGCAA TTCCAATAAC AGTAGAATGA ACACTAAATA 360 

TTGATTTCAA AATCTCAGCA ACTAGAAGAA TGACCAACCA TCCTGGTTGG CCTGGGACTG 420 

TCCTAGTTTT AGCATTGAAA GTTTCAGGTT CCAGGAAAGC CCTCAGGCCT GGGCTGCTGG 480 

TCACCCTAGC AGCTGAGGGA CTCTTCAATA CAGAATTAGT CTTTGTGCAC TGGAGATGAA 540 

TATACTTTAA TTTGTAACAT GTGAAAACAT CTATAAACAT CTACTGAAGC CTGTTCTGTC 600 

TGCACCGACA TTTTCATTGA GTACGGATTC TTCCTACCAG ATACAGCTGC TCTACAACTT 660 

TCGAGGGCTG GTATAAAACT AGCTTTTACC TATTTTTAAA AATTACATGA ATAGTAAAAA 720 

CTTGGATTAA CCCAGTATTC GGGTATTTTC AATTTCCTTG GGAGCTTAGA GGACGGACAA 780 

ATAAAAAGAT TATTTCAACA TCAAATATAY GCTATTGTTT ACATATGAAG ATAACCACAT 840 
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ATATGTATAA ATTCACCGTT ACTTTTTAGC AATACTATAA AATCCAACAG AAAAAAATAG 900 
CATTTACTAT 

Seq ID NO: 159 DNA Sequence 
5 Nucleic Acid Accession 8 : Eos sequence 
Coding sequence j 2 6 4.. 782 

1 li 21 31 41 51 

10 CCCTGCTCCA GTCACACCCG GAAGCTGACT GGTCCACGCA CAGCTGAAGC ATGAGGAAAC 60 

TCATCGCGGG ACTAATTTTC CTTAAAATTT AGACTTGCAC AGTAAGGACT TCAACTGACC 120 

TTCCTCAGAC TGAGAACTGT TTCCAGTATA TACATCAAGT CACTGAGATC TCCAGCACCC 180 

TGCCGGTGGC ACTACTGAGA GACGAGGTGC CAGGGTGGTT CCTGAAAQTG CCTGAGCCCC 240 

AACTTATCAG CAAGGAGCTC ATCATGCTGA CAGAAGTCAT GGAGGTCTGG CATGGCTTAG 300 

15 TGATCGCGGT GGTGTCCCTC TTCCTGCAGG CCTGCTTCCT CACCGCCATC AACTACCTGC 360 

TCAGCAGGCA CATGGCCCAC AAGAGTGAAC AGATACTGAA AGCGGCCAGT CTCCAGGTTC 420 

CCAGGCCCAG CCCTGGCCAC CATCATCCAC CTGCTGTCAA AGAGATGAAG GAGACTCAGA 480 

CAGAGAGAGA CATCCCAATG TCTGATTCCC TTTACAGGCA TGACAGCGAC ACACCCTCAG S40 

ATAGCTTGGA TAGCTCCTGC AGTTCGCCTC CTGCCTGCCA GGCCACAGAG GATGTGGATT 600 

20 ACACACAAGT CGTCTTTTCT GACCCTGGAG AACTAAAAAA TGACTCCCCG CTGGACTATG 660 

AGAACATAAA GGAAATCACA GATTATGTCA ATGTCAATCC AGAAAGACAC AAGCCCAGTT 720 

TCTGGTATTT TGTCAACCCT GCTCTGTCTG AGCCAGCGGA ATATGATCAA GTGGCCATGT 780 

GAATTCCAAA TATTTTTAAT GGGGTCCAGT TCTCTATGGA TTCTTACATT TAATTTGTAG 840 

GGAAATGCCA TTTTTCCCCC TTAAACAAGG CATGGGGCTC ACAAGTCTAT GGAGACAGGC 900 

25 CAAAAAGAAT GTGGAGAAGA AAACTGATAA ATACACAGAG GTCCTCAAGA CCCATGGACT 960 

CCTGGTCTGT ACCCAAAAAA GCTGTTCGTT CCTCAAAAAC AAAAACAAGG CTTGGCTGGG 1020 

AAAACAGGCC AATGCCCCGG CAAGAAAGGT TGAGATCAGA TGTTAGGAAG AACTTTCAGG 1080 

TAAAGTATGA GAACTATGGA GTCCATCAGC AGAGATAGTA GTGAAGTCTC TCCCCAGGGA .1140 

AAATTTTAAA AAGGTTGAAT CAGCTGTTGT AGAGTTCTAT TTGGCAATCT CATGGTTAAA 1200 

30 TGACTTCCCT TTGAGCTCTT TAATTATTGG CAATAAACAA CTTCTTTAAA AGTTTTAAAT 1260 
AAAATAGCAA CCACCACCA 

Seq ID NO: 160 Protein Sequence 
35 Protein Accession #: Eob sequence 

1 11 21 31 41 51 

I I I I I I 

MLTEVMEVWH GLVIAWSLF LQACPLTAIN YLLSRHMAHK SEQILKAASL QVPRPSPGHH 60 

40 HPPAVKEMKE TQTERDIPMS DSLYRHDSDT PSDSLDSSCS SPPACQATED VDYTQWFSD 120 

PGELKNDSPL DYENIKEITD YVNVNPERHK PSFWYFVNPA LSEPAEYDQV AM 

Seq ID NO: 161 DNA Sequence 
Nucleic Acid Accession ft: NM_012152 
45 Coding sequence: 43.. 1104 

1 11 21 31 41 51 

I I I i I I 

CTTCTTTAAA TTTCTTTCTA GGATGTTCAC TTCTTCTCCA CAATGAATGA GTGTCACTAT 60 

50 GACAAGCACA TGGACTTTTT TTATAATAGG AGCAACACTG ATACTGTCGA TGACTGGACA 120 

GGAACAAAGC TTGTGATTGT TTTGTGTGTT GGGACGTTTT TCTGCCTGTT TATTTTTTTT 180 

TCTAATTCTC TGGTCATCGC GGCAGTGATC AAAAACAGAA AATTTCATTT CCCCTTCTAC 240 

TACCTGTTGG CTAATTTAGC TGCTGCCGAT TTCTTCGCTG GAATTGCCTA TGTATTCCTG 300 

ATGTTTAACA CAGGCCCAGT TTCAAAAACT TTGACTGTCA ACCGCTGGTT TCTCCGTCAG 360 

55 GGGCTTCTGG ACAGTAGCTT GACTGCTTCC CTCACCAACT TGCTGGTTAT CGCCGTGGAG 420 

AGGCACATGT CAATCATGAG GATGCGGGTC CATAGCAACC TGACCAAAAA GAGGGTGACA 480 

CTGCTCATTT TGCTTGTCTG GGCCATCGCC ATTTTTATGG GGGCGGTCCC CACACTGGGC 540 

TGGAATTGCC TCTGCAACAT CTCTGCCTGC TCTTCCCTGG CCCCCATTTA CAGCAGGAGT 600 

TACCTTGTTT TCTG6ACAGT GTCCAACCTC ATGGCCTTCC TCATCATGGT TGTGGTGTAC 660 

60 CTGCGGATCT ACGTGTACGT CAAGAGGAAA ACCAACGTCT TGTCTCCGCA TACAAGTGGG 720 

TCCATCAGCC GCCGGAGGAC ACCCATGAAG CTAATGAAGA CGGTGATGAC TGTCTTAGGG 780 

GCGTTTGTGG TATGCTGGAC CCCGGGCCTG GTGGTTCTGC TCCTCGACGG CCTGAACTGC 840 

AGGCAGTGTG GCGTGCAGCA TGTGAAAAGG TGGTTCCTGC TGCTGGCGCT GCTCAACTCC 900 

GTCGTGAACC CCATCATCTA CTCCTACAAG GACGAGGACA TGTATGGCAC CATGAAGAAG 960 

65 ATGATCTGCT GCTTCTCTCA GGAGAACCCA GAGAGGCGTC CCTCTCGCAT CCCCTCCACA 1020 

GTCCTCAGCA GGAGTGACAC AGGCAGCCAG TACATAGAGG ATAGTATTAG CCAAGGTGCA 1080 

GTCTGCAATA AAAGCACTTC CTAAACTCTG GATGCCTCTC GGCCCACCCA GGTGATGACT 1140 
GTCTTAGG 

70 Seq ID NO; 162 Protein Sequence 
Protein Accession 8: NPJ)362B4 

1 11 21 31 41 51 

I I I I I I 

/ J MNECHYDKHM DPFYNRSNTD TVDDWTGTKL VIVLCVGTPF CLFIFFSNSL VIAAVIKNRK 60 

FHFPFYYLLA NLAAADFFAG IAYVFLMFNT GPVSKTLTVN RWFLRQGLLD SSLTASLTNL 120 

LVIAVERHMS IMRMRVHSNL TKKRVTLLIL LVWAIAIFMG AVPTLGWNCL OUSACSSLA 180 

PIYSRSYLVF WTVSNLMAFL IMVWYLRIY VYVKRKTNVL SPHTSGSISR RRTPMKLMKT 240 
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VMTVLGAFW CWTPGLWLL LOGLNCRQCG VQHVKRWFLL UVLLNSWNP 1 1 YSYKDEDM 300 
YGTMXKMICC FSQENPERRP SRIPSTVLSR SDTGSQYIED SISQGAVCNK STS 

5 Seq ID NO: 163 DNA Sequence 

Nucleic Acid Accession ttt NM_020242 
Coding sequences 72.. 4240 

1 11 21 31 41 51 

10 | | | | | | 

CAGTCGCGCG CGGTGCAGTC GGGAGGTGGA GGCACCGGCT GCATTGTTTT CGGGATCGAG 60 

GGGTGAGGGC GCTATGGCAC CCGGCTGCAA AACTGAGTTA CGCAGCGTGA CAAATGGTCA 120 

GTCTAACCAA CCAAGTAATG AAGGTGATGC CATCAAAGTT TTTGTGCGAA TTCGTCCTCC 180 

TGCAGAAAGA TCTGGGTCAG CTGATGGAGA GCAGAACTTA TGCTTATCTG TGCTGTCCTC 240 

15 CACGAGTCTC CGGCTGCACT CCAACCCTGA GCCCAAGACC TTCACGTTTG ATCATGTTGC 300 

AGATGTGGAT ACCACTCAGG AATCTGTATT TGCAACTGTG GCTAAAAGCA TTGTGGAGTC 360 

TTGCATGAGC GGTTATAATG GTACCATCTT TGCATATGGA CAGACTGGCT CAGGGAAGAC 420 

ATTTACTATG ATGGGACCAT CTGAATCTGA TAATTTTTCT CATAACCTGA GAGGAGTAAT 480 

CCCACGAAGT TTTGAATATT TGTTTTCCTT AATTGATCGT GAAAAAGAAA AGGCTGGAGC 540 

20 TGGAAAGAGT TTCCTTTGTA AGTGTTCCTT TATTGAAATC TACAACGAGC AGATATATGA S0O 

TCTACTGGAC TCTGCATCGG CTGGACTGTA CTTAAGGGAG CATATCAAGA AGGGAGTCTT 660 

TGTTGTTGGT GCGGTGGAGC AGGTGGTAAC CTCAGCTGCT GAAGCCTATC AGGTGCTGTC 720 

. TGGAGGATGG AGGAATAGAC GTGTGGCATC AACATCAATG AACAGAGAAT CGTCTAGGTC 780 

TCATGCCGTC TTTACAATTA CAATAGAGTC AATGGAGAAA AGTAATGAGA TTGTGAATAT 840 

25 ACGGACCTCC CTACTCAACC TGGTGGATTT AGCAGGATCT GAAAGGCAAA AAGATACCCA 900 

TGCAGAAGGG ATGAGATTGA AGGAAGCAGG TAACATAAAT CGATCATTGA GCTGCCTGGG 960 

CCAAGTGATT ACAGCACTTG TCGACGTGGG TAATGGAAAA CAGAGACATG TTTGCTACAG 1020 

AGACTCCAAA CTTACCTTCT TACTACGGGA TTCCCTTGGA GGTAATGCCA AAACAGCCAT 1080 

AATTGCAAAT GTTCATCCTG GATCCAGGTG TTTTGGGGAA ACCCTATCAA CACTTAACTT 1140 

30 TGCTCAAAGA GCCAAGCTGA TTAAAAACAA GGCAGTAGTA AATGAAGACA CCCAAGGAAA 1200 

TGTGAGCCAG CTCCAAGCTG AAGTGAAGAG GCTCAAAGAA CAACTGGCGG AGCTTGCTTC 1260 

AGGACAGACA CCACCAGAAA GCTTCCTGAC CAGAGACAAA AAGAAGACTA ACTATATGGA 1320 

GTATTTCCAG GAAGCAATGT TATTCTTTAA GAAATCTGAA CAGGAAAAGA AGTCTCTGAT 1380 

AGAAAAAGTT ACCCAATTAG AAGACCTCAC CCTCAAAAAG GAAAAATTTA TTCAATCTAA 1440 

35 TAAAATGATT GTGAAATTCC GAGAGGATCA AATAATACGC TTGGAAAAGC TCCACAAGGA 1500 

ATCCCGGGGA GGTTTTCTGC CTGAGGAGCA GGATCGTTTG CTCTCAGAAT TAAGGAATGA 1560 

GATTCAAACT CTGCGAGAAC AAATAGAGCA CCACCCCAGA GTTGCAAAGT ATGCTATGGA 1620 

AAATCATTCC CTCAGGGAGG AGAATAGAAG ACTGAGATTA TTAGAGCCTG TGAAAAGAGC 1680 

TCAAGAAATG GATGCCCAGA CCATTGCAAA ACTAGAAAAA GCTTTCTCTG AAATAAGTGG 1740 

40 CATGGAGAAA AGTGACAAAA ATCAGCAAGG ATTTTCACCT AAAGCTCAGA AAGAGCCATG 1800 

TTTGTTTGCA AACACTGAGA AGTTAAAAGC ACAACTCCTG CAAATTCAGA CAGAGCTGAA 1860 

TAATTCAAAG CAAGAATATG AAGAATTCAA AGAACTTACT AGGAAAAGGC AGCTAGAATT 1920 

GGAATCAGAG CTTCAGTCTT TGCAAAAAGC GAACCTTAAT CTTGAAAACC TTTTGGAAGC 1980 

AACAAAAGCC TGCAAGCGGC AAGAAGTTTC TCAGCTGAAT AAAATTCATG CTGAAACACT 2040 

45 TAAGATTATA ACTACACCAA CCAAGGCCTA CCAACTTCAT TCCCGACCAG TACCAAAATT 2100 

AAGCCCTGAA ATGGGAAGCT TTGGCTCTCT ATACACTCAG AATTCTAGCA TATTAGATAA 2160 

TGATATATTA AATGAGCCAG TTCCTCCTGA GATGAATGAA CAAGCTTTTG AGGCCATTTC 2220 

TGAAGAGCTT AGAACAGTGC AGGAACAAAT GAGTGCTCTT CAAGCCAAAC TGGATGAAGA 2280 

AGAGCATAAA AACCTAAAGC TTCAGCAGCA TGTTGACAAA CTGGAACATC ATTCTACCCA 2340 

50 AATGCAGGAG CTTTTCTCAT CAGAAAGAAT TGATTGGACC AAACAGCAGG AAGAGCTTCT 2400 

CTCACAGTTG AATGTCCTTG AAAAGCAGCT TCAAGAGACT CAAACTAAAA ATGACTTTTT 2460 

GAAAAGTGAG GTACATGACC TGCGAGTAGT CCTTCATTCT GCTGACAAGG AGCTTTCTTC 2520 

AGTGAAATTG GAATATAGTT CATTCAAAAC GAATCAGGAG AAAGAATTCA ACAAACTTTC 2580 

TGAAAGACAC ATGCATGTAC AGCTTCAATT AGATAATCTC AGGTTAGAAA ACGAAAAGCT 2640 

55 GCTTGAGAGC AAAGCCTGCC TACAGGATTC CTATGACAAC TTACAAGAAA TAATGAAATT 2700 

TGAGATTGAC CAACTTTCAA GAAACCTCCA AAACTTCAAA AAAGAAAATG AAACTCTGAA 2760 

ATCTGATCTG AATAATTTGA TGGAGCTTCT TGAGGCAGAA AAAGAACGCA ATAACAAATT 2820 

ATCATTACAG TTTGAAGAAG ATAAAGAAAA CAGTTCTAAA GAAATCTTAA AAGTTCTTGA 2880 

c GGCTGTACGT CAGGAGAAAC AGAAAGAGAC GGCCAAGTGT GAGCAGCAGA TGGCAAAAGT 2940 

60 ACAGAAACTA GAAGAGAGCT TGCTTGCTAC TGAAAAAGTG ATCAGTTCCC TGGAAAAGTC 3000 

TAGAGATTCT GATAAGAAAG TTGTAGCTGA CCTCATGAAC CAGATCCAGG AGCTAAGAAC 3060 

ATCGGTCTGT GAGAAAACAG AAACTATAGA CACCCTGAAA CAAGAACTGA AGGACATAAA 3120 

TTGCAAATAC AACTCTGCTT TGGTTGACAG AGAAGAGAGC AGAGTGTTGA TCAAGAAGCA 3180 

- GGAAGTGGAT ATTCTGGATC TGAAAGAAAC CCTTAGGCTG AGAATACTTT CTGAGGACAT 3240 

05 AGAGAGGGAT ATGCTCTGTG AGGACCTGGC TCATGCCACT GAGCAGCTGA ACATGCTCAC 3300 

AGAGGCCTCA AAAAAACACT CGGGGCTGCT GCAGTCTGCC CAGGAAGAAC TGACCAAGAA 3360 

GGAAGCCCTG ATTCAGGAAC TTCAGCACAA GCTAAACCAA AAGAAAGAGG AAGTAGAACA 3420 

GAAGAAGAAT GAATATAACT TCAAAATGAG GCAACTAGAA CATGTGATGG ATTCTGCTGC 3480 

TGAGGATCCC CAGAGTCCTA AGACACCACC TCACTTTCAA ACACATTTGG CAAAACTCCT 3540 

70 GGAAACACAA GAACAAGAGA TAGAAGATGG AAGAGCCTCT AAGACTTCTT TGGAACACCT 3600 

TGTAACAAAG CTAAATGAAG ACAGAGAAGT CAAAAATGCT GAAATCCTCA GAATGAAGGA 3660 

GCAGTTGCGT GAAATGGAAA ACCTACGCCT GGAAAGTCAG CAGTTAATAG AGAAAAACTG 3720 

GCTCCTGCAA GGTCAGCTGG ATGATATTAA AAGACAAAAG GAAAACAGTG ATCAGAATCA 3780 

TCCAGATAAT CAACAGCTGA AGAATGAACA AGAAGAAAGT ATCAAAGAAA GACTTGCAAA 3840 

75 AAGTAAAATA GTTGAAGAAA TGCTGAAAAT GAAAGCAGAC CTAGAAGAAG TCCAAAGTGC 3900 

CCTTTACAAC AAAGAGATGG AATGCCTTAG AATGACTGAT GAAGTCGAAC GAACCCAAAC 3960 

TTTGGAGTCT AAAGCATTCC AGGAAAAAGA ACAACTGAGA TCAAAGCTGG AAGAAATGTA 4020 

TGAAGAAAGA GAGAGAACAT CCCAGGAGAT GGAAATGTTA AGGAAGCAGG TGGAGTGTCT 4080 
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5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 



TGCTGAGGAA 
AGTGCGACTA 
AAATGTATTT 
GGCATCACCT 
TTAGGAGAGC 
CTCAAGTTTC 
ATTAAGTGGC 
AAATAAAAGC 
TTATGCCATA 
CCTGCAGTGA 
GATTATTTTT 
AAGCCATTTT 



AATGGAAAGT 
AAGAAGGAAA 
TTAAAAGAAA 
TGTTTGAAGA 
TGAATTTATG 
TGATGAACAT 
CTACTTCAAG 
CTGTAGCTAA 
ATCTCCTTTG 
GTTTAATGAC 
GTATTCTTAC 
TTAAGATAAA 



TGGTAGGTCA 
ATGTCAGGCT 
AQAAAAGAAG 
TGTTTCTTCT 
GACCTTAATT 
TCTGCATOCA 
GCTTTGAATC 
GGTTTACAGT 
ATTCTTATGG 
TGACTTAGTA 
TTTAGGTATT 
AAAAAAAAAA 



CCAAAATTTG 
TGCTGAGGAG 
TGAATCTTGA 
CTTTTACAAG 
ATTAAATGTT 
TATACACCCT 
AACTTAAGGG 
GGACATTAGC 
AAGTTCTAAC 
GCAGGTACAA 
TTACTTGAGC 



CATCAGAAGA 
ACAGAAAAGT 
GGATTCCGGT 
TAAGACCTAC 
TATAAGGTGG 
GTGACAGTCA 
AAAACCTTTT 
CAGATCATTT 
AATATATGGT 
GAAGCAAACT 
ATTTTCCATG 



TTCAGTACGT 
TGCGTGCCGA 
CAGCTACCTA 
TCCTGGCCAC 
TGGTAACCAC 
GCAGTCTGCT 
GTCTTTGTAA 
TCTTCTTAGA 
GGTTCCAACA 
TGTTAATATA 
ACTGTAAATA 



Seq ID NO: 164 Protein Sequence 
Protein Accession ft: NP_064627 



1 

! 

MAPGCKTELR 
LHSNPEPKTF 
GPSESDNFSH 
ASAGLYLREH 
TITIESMEKS 
ALVDVGNGKQ 
KLIKNKAWN 
AMLFFKKSEQ 
FLPEEOORLL 
AQTIAKLEKA 
EYEEFKELTR 
TPTKAYQLHS 
TVQEQMSALQ 
VLEKQLQETQ 
HVQLQliDNLR 
NLMELLEAEK 
ESLLATEKVI 
SALVDREESR 
KHSGLIiQSAQ 
SPKTPPHFQT 
MENLRLESQQ 
EEMLKMKADL 
RTSQEMEMLR 



11 
I 

SVTNGQSNQP 
TFDHVADVDT 
NLRGVIPRSF 
IKKGVFWGA 
NE1VNIRTSL 
RHVCYRDSKL 
EDTQGNVSQL 
EKKSLIEKVT 
SELRNEIQTL 
FSEISGMEKS 
KRQLELESEL 
RPVPKLSPEM 
AKLDEEEHKN 
TKNDFLKSEV 
LENEKLLESK 
ERKNKLSLQP 
SSLEKSRDSD 
VLIKKQEVDI 
EELTKKEALI 
HLAKLLETQE 
LIEKNWLLQG 
EEVQSALYNK 
KQVECIAEEN 



21 
I 

SNEGDAIKVF 
TQESVPATVA 
EYLFSLIDRE 
VEQWTSAAE 
LNLVDLAGSE 
TFLLRDSLGG 
QAEVKRLKEQ 
QLEDLTLKKE 
REQIEHHPRV 
DKNQQGFSPK 
QSLQKANLNL 
GSFGSIjYTQN 
LKLQQHVDKL 
HDLRWLHSA 
ACLQDSYDNL 
EEDKENSSKE 
KKWADLMNQ 
LDLKETLRLR 
QELQHKLNQK 
QEIEDGRASK 
QLDDIKRQKE 
EMECLRMTDE 
GKLVGHQNLH 



31 
I 

VRIRPPAERS 
KSIVESCMSG 
KEKAGAGKSF 
AYQVLSGGWR 
RQKDTHAEGM 
NAKTAIIANV 
LAELASGQTP 
KFIQSNKMIV 
AKYAMENHSL 
AQKEPCLFAN 
ENUjEATKAC 
SSI LEND I LN 
EHHSTQMQEL 
DKELSSVKLE 
QEIMKFEIDQ 
ILKVLEAVRQ 
IQELRTSVCE 
ILSEDIERDM 
KEEVEQKKNE 
TSLEHLVTKL 
NSDQNHPDNQ 
VERTQTLESK 
QKIQYWRLK 



41 

I 

GSADGEQNLC 
YNGTIFAYGQ 
LCKCSFIEIY 
NRRVASTSMN 
RLXEAGNINR 
HPGSRCFGET 
PESFLTRDKK 
KFREDQIIRL 
REENRRLRLL 
TEKLKAQLLQ 
KRQEVSQLNK 
EPVPPEMNEQ 
FSSER1DVJTK 
YSSFKTNQEK 
LSRNLQNFKK 
EKQKETAKCE 
KTETIDTLKQ 
LCEDLAHATE 
YNFKMRQLEH 
NEDREVKNAE 
QLKNEQEESI 
AFQEKEQLRS 
KENVRLAEET 



51 
I 

LSVLSSTSLR 
TGSGKTFTMM 
NEQIYDLLDS 
RESSRSHAVF 
SLSCLGQVIT 
LSTLNFAQRA 
KTNYMEYFQB 
EKLHKESRGG 
EPVKRAQEMD 
IQTELNNSKQ 
IHAETLKIIT 
APEAISEELR 
QQEELLSQLN 
EFNKLSERHM 
ENETLKSDLN 
QQMAKVQKLE 
ELKDINCKYN 
QLNMLTEASK 
VMDSAAEDPQ 
ILRMKEQLRE 
KERLAKSKIV 
KLEEMYEERE 
EKLRAENVFL 



Seq ID NO: 165 DNA Sequence 

Nucleic Acid Accession #: CAT cluster 



1 

I 

TTTTTTTTTT 
GATTCCGTGA 
TTAATGCTAT 
GGTCTTAATT 
CTGAGCTCCT 
CATTAGGTAC 
AATTTAATAA 
TGAAAATATG 
AAACTCAGAG 
TACGTGATCC 
TTTCTTTCCC 
AATTAAACTT 
AGTCCTCCCT 
GCCATTTTTG 
GAGAAGATTT 
CCGAATAGTG 



11 
I 

TTTTTTCACA 
GTAGTCTTGA 
TGTATTTCCT 
GAATGGTGGG 
TTTAAAATCA 
AGTATTTTAA 
GGAATAATGA 
AGAACAAATA 
GTAAACTTGT 
TATATTTTTC 
TCTTCCCAAG 
TTGGAGGGTA 
GTAACTGGGT 
TTTCGTCTTT 
TGGTCGGACT 
GCTTTTGGAT 



21 
1 

TGCTGATGTT 
ATAATTTAAA 
AATTCTCAGC 
GTCGAGATGA 
TCAACAAGCG 
ACATACAAAA 
TGTCCTTAAG 
GATTTAAATA 
GAATACATAA 
TGTCCACGTC 
ATTATCCAAG 
ATCGCCAGCT 
TTTCCCGGGA 
TTTGTTTTGT 
AGCGCCATCC 
GAACAATGAA 



31 
I 

GCTGCCGATG 
ATTCGAAAAT 
TTTAACATGT 
CTGCGTCAGA 
AAAATCCTTA 
CCTAGGCATA 
TTTATTTTAA 
GGAACACCCA 
ATCTAAATAA 
TTAAAAGTAT 
GCAGTTCCAA 
GTTTCCAGAA 
ATCCGGGGGG 
TGGTGGTCAG 
AGCGGGTTTA 



41 
I 

ATTTCAACGC 
CAAATTCTAC 
AAGAAAGTAC 
ATTAAATCTC 
TCAATAGCGA 
TTAAAAAGCA 
TCAGCAAGTA 
GTAAACTATO 
GTCAGTTACC 
TTTTTACCTT 
TACGCGTTTT 
AAATACTTTT 
GGCATGTTGT 
GTTTTCTTTT 
GAGAAGGAGC 



51 
I 

CTGGCTTTGA 
TTATTTTCTC 
TTTCGCTAGG 
TGGAAGACCT 
TGTGGGAATG 
CTCCTCTGGT 
TGACTCAATT 
GTATGCAAAT 
ATCAAAATAT 
GGGGCTTTAT 
CAATAATGGT 
AATAGGTGGT 
TTTTATTGTC 
CATAGCGGGG 
ACACTTTTCC 



Seq ID NO: 166 DNA Sequence 

Nucleic Acid Accession ft: NM_006953.1 

Coding sequence: 33.. 896 



1 
I 

CCGTTCCGCG 
TCGGCTGCCT 
TCGCCACCAA 
ACAGCAAAGA 
CCATTTCCAG 
TACAAACAGA 
GCAGTGACCT 



11 
I 

CTCTGGCGGC 
GCGGTTOGGC 
CAACCCCACA 
GGCCCTCACT 
GAATGCCTCA 
GGGTGGGAGG 
GCCCAGCCTG 



21 
I 

TCCTCCCGGG 
TCGGCTGTGA 
CTTACCACTG 
GGCACCCACG 
GTGCAAGACA 
ACAGGTCCCT 
GATGCCATTG 



31 
I 

CGATGCCTCC 
ACCTGCAGCC 
TGGCCTTGGA 
AGGTCTACCT 
GCACCAACAC 
ACAAAGCTGT 
GGGATGTGTC 



41 
I 

GCTCTGGGCC 
CCAACTGGCC 
AAAGCCTCTC 
GTATGTCCTG 
CCCACTGGGC 
GGCCTTTGAC 
CAAGGCCTCA 



51 
I 

CTGCTGGCCC 
AGTGTGACTT 
TGCATGTTTG 
GTCGACTCAG 
TCAACGTTCC 
CTGATCCCCT 
CAGATCCTGA 



4140 
4200 
4260 
4320 
4360 
4440 
4S00 
4560 
4620 
4660 
4740 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1360 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 



60 

120 

180 

240 

300 

360 

420 
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ATGCCTACCT GGTCAGGGTG GGTGCCAACG GGACCTGCCT GTGQGATCCC AACTTCCAGG 480 

GCCTCTGTAA CGCACCCCTG TCGGCAGCCA CGGAGTACAG GTTCAAGTAT GTCCTGGTCA 540 

ATATGTCCAC GGGCTTGGTA GAGGACCAGA CCCTGTGGTC GGACCCCATC CGCACCAACC 600 

AGCTCACCCC ATACTCGACG ATCGACACGT GGCCAGGCCG GCGGAGOGGA GGCATGATCG 660 

5 TCATCACTTC CATCCTGGGC TCCCTGCCCT TCTTTCTACT TGTGGGTTTT GCTGGCGCCA 720 

TTGCCCTCAG CCTCGTGGAC ATGGGGAGTT CTGATGGGGA AACGACTCAC GACTCCCAAA 7 BO 

TCACTCAGGA GGCTGTTCCC AAGTCGCTGG GGGCCTCGGA GTCTTCCTAC ACGTCCGTGA 840 

ACCGGGGGCC GCCACTGGAC AGGGCTGAGG TGTATTCCAG CAAGCTCCAA GACTGAGCCC 900 

AGCACCACCC CTGGGCAGCA GCATCCTCCT CTCTGGCCTT GCCCCAGGCC CTGCAGCGGT 960 

10 GGTTGTCACA CCCTGACTTC AGGGAAGGTG AAACAGGGCT TGTCCCTCCA ACTGCAGGAA 1020 
AACCCTTAAT AAAATCTTCT GATGAGTTCT AAAAAAAAA 



15 



Seq ID NO i 167 Protein Sequence 
Protein Accession fl: NP 008884.1 



1 11 21 31 41 51 

1 I I I I I 

MPPLWALLAL. GCLRFGSAVN LQPQLASVTF ATNNPTLTTV ALEKPLCMFD SKEALTGTHE 60 

VYLYVLVDSA ISRNASVQDS TNTPLGSTPL QTBGGRTGPY KAVAFDLIPC SDLPSLDAIG 120 

20 DVSKASQILN AYLVRVGANG TCLWDPNFQG LCNAPLSAAT EYRFKYVLVN MSTGLVEDQT 180 

LWSDPIRTNQ LTPYSTIDTW PGRRSGGMIV ITSILGSLPF FLLVGPAGAI ALSI>VDMGSS 240 
DGETTHDSQI TQEAVPKSLG ASESSYTSVN RGPPLDRAEV YSSKIXJD 

25 Seq ID NO: 168 DNA Sequence 

Nucleic Acid Accession #: NM_005672.l 
Coding sequence : 1 8 . . 3 8 9 

1 11 21 31 41 51 

30 | I I 1 I I 

AGGGAGAGGC AGTGACCATG AAGGCTGTGC TGCTTGCCCT GTTGATGGCA GGCTTGGCCC 60 

TGCAGCCAGG CACTGCCCTG CTGTGCTACT CCTGCAAAGC CCAGGTGAGC AACGAGGACT 120 

GCCTGCAGGT GGAGAACTGC ACCCAGCTGG GGGAGCAGTG CTGGACCGCG OGCATCCGCG 180 

CAGTTGGCCT CCTGACCGTC ATCAGCAAAG GCTGCAGCTT GAACTGCGTG GATGACTCAC 240 

35 AGGACTACTA CGTGGGCAAG AAGAACATCA CGTGCTGTGA CACCGACTTG TGCAACGCCA 300 

GCGGGGCCCA TGCCCTGCAG CCGGCTGCCG CCATCCTTGC GCTGCTCCCT GCACTCGGCC 360 

TGCTGCTCTG GGGACCCGGC CAGCTATAGG CTCTGGGGGG CCCCGCTGCA GCCCACACTG 420 

GGTGTGGTGC CCCAGGCCTT TGTGCCACTC CTCACAGAAC CTGGCCCAGT GGGAGCCTGT 480 

CCTGGTTCCT GAGGCACATC CTAACGCAAG TTTGACCATG TATGTTTGCA CCCCTTTTCC 540 

40 CCNAACCCTG ACCTTCCCAT GGGCCTTTTC CAGGATTCCN ACCNGGCAGA TCAGTTTTAG 600 

TGANACANAT CCGCNTGCAG ATGGCCCCTC CAACCNTTTN TGTTGNTGTT TCCATGGCCC 660 

AGCATTTTCC ACCCTTAACC CTGTGTTCAG GCACTTNTTC CCCCAGGAAG CCTTCCCTGC 720 

CCACCCCATT TATGAATTGA GCCAGGTTTG GTCCGTGGTG TCCCCCGCAC CCAGCAGGGG 780 

ACAGGCAATC AGGAGGGCCC AGTAAAGGCT GAGATGAAGT GGACTGAGTA GAACTGGAGG 840 

45 ACAAGAGTTG ACGTGAGTTC CTGGGAGTTT CCAGAGATGG GGCCTGGAGG CCTGGAGGAA 900 

GGGGCCAGGC CTCACATTTG TGGGGNTCCC GAATGGCAGC CTGAGCACAG CGTAGGCCCT 960 
TAATAAACAC CTGTTGGATA AGCCAAAAAA 

Seq ID NO: 169 Protein Sequence 
50 Protein Accession |t NP_00S663.1 

1 11 21 31 41 51 

I I I I I I 

MKAVLLALLM AGLALQPGTA LLCYSCKAQV SNEDCLQVEN CTQLGEQCWT ARIRAVGLLT 50 
55 VISKGCSLNC VDDSQDYYVG KKNITCCDTD LCNASGAHAL QPAAAILALL PALGLLLWGP 120 
GQL 

Seq ID NO: 170 DNA Sequence 
Nucleic Acid Accession 8: NMJJ06952.1 
60 Coding sequence: 11.. 7 93 

1 11 21 31 41 51 

I I I i i l 

AATCCCGACA ATGGCGAAAG ACAACTCAAC TGTTCGTTGC TTCCAGGGCC TGCT GATTT T 60 

65 TGGAAATGTG ATTATTGGTT GTTGCGGCAT TGCCCTGACT GCGGAGTGCA TCTTCTTTGT 120 

ATCTGACCAA CACAGCCTCT ACCCACTGCT TGAAGCCACC GACAACGATG ACATCTATGG 180 

GGCTGCCTGG ATCGGCATAT TTGTGGGCAT CTGCCTCTTC TGCCTGTCTG TTCTAGGCAT 240 

TGTAGGCATC ATGAAGTCCA GCAGGAAAAT TCTTCTGGCG TATTTCATTC TGATGTTTAT 300 

AGTATATGCC TTTGAAGTGG CATCTTGTAT CACAGCAGCA ACACAACGAG ACTTTTTCAC 360 

70 ACCCAACCTC TTCCTGAAGC AGATGCTAGA GAGGTACCAA AACAACAGCC CTCCAAACAA 420 

TGATGACCAG TGGAAAAACA ATGGAGTCAC CAAAACCTGG GACAGGCTCA TGCTCCAGGA 480 

CAATTGCTGT GGCGTAAATG GTCCATCAGA CTGGCAAAAA TACACATCTG CCTTCCGGAC 540 

TGAGAATAAT GATGCTGACT ATCCCTGGCC TCGTCAATGC TGTGTTATGA ACAATCTTAA 600 

AGAACCTCTC AACCTGGAGG CTTGTAAACT AGGCGTGCCT GGTTTTTATC ACAATCAGGG 660 

75 CTGCTATGAA CTGATCTCTG GTCCAATGAA CCGACACGCC TGGGGGGTTG CCTGGTTTGQ 720 

ATTTGCCATT CTCTGCTGGA CTTTTTGGGT TCTCCTGGGT ACCATGTTCT ACTGGAGCAG 780 
AATTGAATAT TAAGAA 
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Seq ID NO: 171 Protein Sequence 
Protein Accession fh NP_0088B3.1 

^ 1 11 21 31 41 51 

5 | | I | I ] 

MAKDNSTVRC FQGLLIFGNV IIGCCGIALT AECIFFVSDQ HSLYPLLEAT DNDDIYGAAW 60 
IGIFVGICLF CLSVLGIVGI MKSSRKILLA YFILMFIVYA FEVASCITAA TQRDFPTPNL 120 
FLKQMLBRYQ NNSPPNNDDQ WKNNGVTKTW DRLMLQDNCC GVNGPSDWQK YTSAFRTENN 180 
DADYPWPRQC CVMNNLKEPL NLEACKLGVP GFYHNQGCYE LISGPMNRHA WGVAWFGFAI 240 
10 LCWTFWVLLG TMFYWSRIEY 

Seq ID NO : 172 DMA Sequence 
Nucleic Acid Accession #: Eos sequence 
^ Coding sequence » 1..672 

1 11 21 31 41 51 

111111 

ATGAGGCTCC AAAGACCCCG ACAGGCCCCG GCGGGTGGGA GGCGCGCGCC CCGGGGCGGG 60 

CGGGGCTCCC CCTACCGGCC AGACCCGGGG AGAGGCGCGC GGAGGCTGCG AAGGTTCCAG 120 

20 AAGGGCGGGG AGGGGGCGCC GCGCGCTGAC CCTCCCTGGG CACCGCTGGG GACGATGGCG 180 

CTGCTCGCCT TGCTGCTGGT CGTGGCCCTA CCGCGGGTGT GGACAGACGC CAACCTGACT 240 

GCGAGACAAC GAGATCCAGA GGACTCCCAG CGAACGGACG AGGGTGACAA TAGAGTGTGG 300 

TGTCATGTTT GTGAGAGAGA AAACACTTTC GAGTGCCAGA ACCCAAGGAG GTGCAAATGG 360 

ACAGAGCCAT ACTGCGTTAT AGCGGCCGTG AAAATATTTC CACGTTTTTT CATGGTTGCG 420 

25 AAGCAGTGCT CCGCTGGTTG TGCAGCGATG GAGAGACCCA AGCCAGAGQA GAAGCGGTTT 480 

CTCCTGGAAG AGCCCATGCC CTTCTTTTAC CTCAAGTGTT GTAAAATTCG CTACTGCAAT 540 

TTAGAGGGGC CACCTATCAA CTCATCAGTG TTCAAAGAAT ATGCTGGGAG CATGGGTGAG 600 

AGCTGTGGTG GGCTGTGGCT GGCCATCCTC CTGCTGCTGG CCTCCATTGC AGCCGGCCTC 660 
AGCCTGTCTT GA 
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Seq ID NO: 173 Protein Sequence 
Protein Accession #: Eos sequence 



1 11 21 31 41 51 

35 | | | | | 1 

MRLQRPRQAP AGGRRAPRGG RGSPYRPDPG RGARRLRRFQ KGGEGAPRAD PPWAPLGTMA 60 

LLALLLWAL PRVWTDANLT ARQRDPEDSQ RTDEGDNRVW CHVCERENTF ECQNPRRCKW 120 

TEPYCVIAAV KIFPRFFMVA KQCSAGCAAM ERPKPEEKRF LLEEPMPFFY LKCCKIRYCN 180 
LEGPPINSSV FKEYAGSMGE SCGGLWLAIL LLLASIAAGL SLS 



Seq ID NO: 174 DNA Sequence 
Nucleic Acid Accession 8: XM_057014 
Coding sequence: 143.. 874 



1 11 21 31 41 51 

I I I I i I 

GGGAGGGAGA GAGGCGCGCG GGTGAAAGGC GCATTGATGC AGCCTGCGGC GGCCTCGGAG 60 

CGCGGCGGAG CCAGACGCTG ACCACGTTCC TCTCCTCGGT CTCCTCCGCC TCCAGCTCCG 120 

50 CGCTGCCCGG CAGCCGGGAG CCATGCGACC CCAGGGCCCC GCCGCCTCCC CGCAGCGGCT 180 

CCGCGGCCTC CTGCTGCTCC TGCTGCTGCA GCTGCCCGCG CCGTCGAGCG CCTCTGAGAT 240 

CCCCAAGGGG AAGCAAAAGG CGCAGCTCOG GCAGAGGGAG GTGGTGGACC TGTATAATGG 300 

AATGTGCTTA CAAGGGCCAG CAGGAGTGCC TGGTCGAGAC GGGAGCCCTG GGGCCAATGG 360 

CATTCCGGGT ACACCTGGGA TCCCAGGTCG GGATGGATTC AAAGGAGAAA AGGGGGAATG 420 

55 TCTGAGGGAA AGCTTTGAGG AGTCCTGGAC ACCCAACTAC AAGCAGTGTT CATGGAGTTC 480 . 

ATTGAATTAT GGCATAGATC TTGGGAAAAT TGCGGAGTGT ACATTTACAA AGATGCGTTC 540 

AAATAGTGCT CTAAGAGTTT TGTTCAGTGG CTCACTTOGG CTAAAATGCA GAAATGCATG 600 

CTGTCAGCGT TGGTATTTCA CATTCAATGG AGCTGAATGT TCAGGACCTC TTCCCATTGA 660 

. A AGCTATAATT TATTTGGACC AAGGAAGCCC TGAAATGAAT TCAACAATTA ATATTCATCG 720 

60 CACTTCTTCT GTGGAAGGAC TTTGTGAAGG AATTGGTGCT GGATTAGTGG ATGTTGCTAT 780 

CTGGGTTGGC ACTTGTTCAG ATTACCCAAA AGGAGATGCT TCTACTGGAT GGAATTCAGT 840 

TTCTCGCATC ATTATTGAAG AACTACCAAA ATAAATGCTT TAATTTTCAT TTGCTACCTC 900 

TTTTTTTATT ATGCCTTGGA ATGGTTCACT TAAATGACAT TTTAAATAAG TTTATGTATA 960 

CATCTGAATG AAAAGCAAAG CTAAATATGT TTACAGACCA AAGTGTGATT TCACACTGTT 1020 

65 TTTAAATCTA GCATTATTCA TTTTGCTTCA ATCAAAAGTG GTTTCAATAT TTTTTTTAGT 1080 

TGGTTAGAAT ACTTTCTTCA TAGTCACATT CTCTCAACCT ATAATTTGGA ATATTGTTGT 1140 

GGTCTTTTGT TTTTTCTCTT AGTATAGCAT TTTTAAAAAA ATATAAAAGC TACCAATCTT 1200 

TGTACAATTT GTAAATGTTA AGAATTTTTT TTATATCTGT TAAATAAAAA TTATTTCCAA 1260 
CAACCTTAAA AAAAAAAAAA AAAA 



Seq ID NO: 175 Protein Sequence 
Protein Accession ft: XP_057014 



1 11 21 31 41 51 

75 | | | | | | 

MRPQGPAASP QRLRGLLLLL LLQLPAPSSA SEIPKGKQKA QLRQREWDL YNGMCIiQGPA 60 
GVPGRDGSPG ANGIPGTPGI PGRDGFKGEK GECLRESFBE SWTPNYKQCS WSSLNYGIDL 120 
GKIABCTFTK MRSNSALRVL FSGSLRLKCR NACCQRWYFT FNGAECSGPL PIEAIIYLDQ 180 
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GSPEMNSTIN IHRTSSVEGL CEGIGAGLVD VAIWVGTCSD YPKGDASTGW NSVSRIIIEE 240 
LPK I 

Seq ID NO: 176 DNA Sequence 
5 Nucleic Acid Accession ft: XM_084007 
Coding sequence: 1..2266 

1 11 21 31 41 51 

in' I 1 1 1 1 

1(J ATGGCGAGGA AGTTATCTGT AATCTTGATC CTGACCTTTG CCCTCTCTGT CACAAATCCC 60 

CTTCATGAAC TAAAAGCAGC TGCTTTCCCC CAGACCACTG AGAAAATTAG TCCGAATTGG 120 

GAATCTGGCA TTAATGTTGA CTTGGCAATT TCCACACGGC AATATCATCT ACAACAGCTT ISO 

TTCTACCGCT ATGGAGAAAA TAATTCTTTG TCAGTTGAAG GGTTCAGAAA ATTACTTCAA 240 

AATATAGGCA TAGATAAGAT TAAAAGAATC CATATACACC ATGACCACGA CCATCACTCA 300 

15 GACCACGAGC ATCACTCAGA CCATGAGCGT CACTCAGACC ATGAGCATCA CTCAGACCAC 360 

GAGCATCACT CTGACCATOA TCATCACTCC CACCATAATC ATGCTGCTTC TGGTAAAAAT 420 

AAGOGAAAAG CTCTTTGCCC AGACCATGAC TCAGATAGTT CAGGTAAAGA TCCTAGAAAC 480 

AGCCAGGGGA AAGGAGCTCA CCGACCAGAA CATGCCAGTG GTAGAAGGAA TGTCAAGGAC 540 

AGTGTTAGTG CTAGTGAAGT GACCTCAACT GTGTACAACA CTGTCTCTGA AGGAACTCAC 600 

20 TTTCTAGAGA CAATAGAGAC TCCAAGACCT GGAAAACTCT TCCCCAAAGA TGTAAGCAGC 660 

TCCACTCCAC CCAGTGTCAC ATCAAAGAGC CGGGTGAGCC GGCTGGCTGG TAGGAAAACA 720 

AATGAATCTG TGAGTGAGCC CCGAAAAGGC TTTATGTATT CCAGAAACAC AAATGAAAAT 780 

CCTCAGGAGT GTTTCAATGC ATCAAAGCTA CTGACATCTC ATGGCATGGG CATCCAGGTT 840 

CCGCTGAATG CAACAGAGTT CAACTATCTC TGTCCAGCCA TCATCAACCA AATTGATGCT 900 

25 AGATCTTGTC TGATTCATAC AAGTGAAAAG AAGGCTGAAA TCCCTCCAAA GACCTATTCA 960 

TTACAAATAG CCTGGGTTGG TGGTTTTATA GCCATTTCCA TCATCAGTTT CCTGTCTCTG 1020 

CTGGGGGTTA TCTTAGTGCC TCTCATGAAT CGGGTGTTTT TCAAATTTCT CCTGAGTTTC 1080 

CTTGTGGCAC TGGCCGTTGG GACTTTGAGT GGTGATGCTT TTTTACACCT TCTTCCACAT 1140 

TCTCATGCAA GTCACCACCA TAGTCATAGC CATGAAGAAC CAGCAATGGA AATGAAAAGA 1200 

30 GOACCACTTT TCAGTCATCT GTCTTCTCAA AACATAGAAG AAAGTGCCTA TTTTGATTCC 12 60 

ACGTGGAAGG GTCTAACAGC TCTAGGAGGC CTGTATTTCA TGTTTCTTGT TGAACATGTC 1320 

CTCACATTGA TCAAACAATT TAAAGATAAG AAGAAAAAGA ATCAGAAGAA ACCTGAAAAT 1380 

GATGATGATG TGGAGATTAA GAAGCAGTTG TCCAAGTATG AATCTCAACT TTCAACAAAT 1440 

GAGGAGAAAG TAGATACAGA TGATCGAACT GAAGGCTATT TACGAGCAGA CTCACAAGAG 1500 

35 CCCTCCCACT TTGATTCTCA GCAGCCTGCA GTCTTGGAAG AAGAAGAGGT CATGATAGCT 1560 

CATGCTCATC CACAGGAAGT CTACAATGAA TATGTACCCA GAGGGTGCAA GAATAAATGC ,1620 

CATTCACATT TCCACGATAC ACTCGGCCAG TCAGACGATC TCATTCACCA CCATCATGAC 1680 

TACCATCATA TTCTCCATCA TCACCACCAC CAAAACCACC ATCCTCACAG TCACAGCCAG 1740 

CGCTACTCTC GGGAGGAGCT GAAAGATGCC GGCGTCGCCA CTTTGGCCTG GATGGTGATA 1800 

40 ATGGGTGATG GCCTGCACAA TTTCAGCGAT GGCCTAGCAA TTGGTGCTGC TTTTACTGAA 1860 

GGCTTATCAA GTGGTTTAAG TACTTCTGTT GCTGTGTTCT GTCATGAGTT GCCTCATGAA 1920 

TTAGGTGACT TTGCTGTTCT ACTAAAGGCT GGCATGACCG TTAAGCAGGC TGTCCTTTAT 1980 

AATGCATTGT CAGCCATGCT GGCGTATCTT GGAATGGCAA CAGGAATTTT CATTGGTCAT 2040 

TATGCTGAAA ATGTTTCTAT GTGGATATTT GCACTTACTG CTGGCTTATT CATGTATGTT 2100 

45 GCTCTGGTTG ATATGGTACC TGAAATGCTG CACAATGATG CTAGTGACCA TGGATGTAGC 2160 

CGCTGGGGGT ATTTCTTTTT ACAGAATGCT GGGATGCTTT TGGGTTTTGG AATTATGTTA 2220 
CTTATTTCCA TATTTGAACA TAAAATCGTG TTTCGTATAA ATTTCTAG 

Seq ID NO: 177 Protein Sequence 
50 Protein Accession #: XP_084007 

1 11 21 31 41 51 

I I I I ! I 

MARKLSVILI LTFALSVTNP LHELKAAAFP QTTEKISPNW ESGINVDLAI STRQYHLQQL 60 

55 FYRYGENNSL SVEGFRXLLQ NIGIDKIKRI HIHHDHDHHS DHEHHSDHER HSDHEHHSDH 120 

EHHSDHDHHS HHNHAASGKN KRKALCPDHD SDSSGKDPRN SQGKGAHRPE HASGRRNVKD 180 

SVSASBVTST VYNTVSEGTH PLETIETPRP GKLFPKDVSS STPPSVTSKS RVSRLAGRKT 240 

NESVSEPRKG FMYSRNTNEN PQECFNASKL LTSHGMGIQV PLNATEFNYL CPAIINQ1DA 300 

RSCLIHTSEK KAEIPPKTYS LQIAWVGGFI AISIISFLSb LGVILVPLMN RVFFKFL&SF 360 

60 LVALAVGTLS GDAFLHLLPH SHASHHHSHS HEEPAMEMKR GPLFSHLSSQ NIEESAYFDS 420 

TWKGLTALGG LYFMFLVEHV LTLIKQFKDK KKKNQKKPEN DDDVEIKKQL SKYESQLSTN 480 

EEKVDTDDRT EGYLRADSQE PSHFDSQQPA VLEEEEVMIA HAHPQEVYNE YVPRGCKNKC 540 

HSHFHDTLGQ SDDLIHHHHD YHHILHHHHH QNHHPHSHSQ RYSREELKDA GVATLAWMVI 600 

MGDGLHNFSD GLAIGAAFTE GLSSGLSTSV AVFCHELPHE LGDFAVLLKA GMTVKQAVLY 660 

65 NAXiSAMIiAYL GMATGIFIGH YAENVSMWIF ALTAGLFMYV ALVDMVPEML HNDASDHGCS 720 
RWGYFFLQNA GMLLGFGIML LISIFEHKIV FRINF 

Seq ID NO: 178 DNA Sequence 
Nucleic Acid Accession fit BC010423 
70 Coding sequence i 248.. 1780 

1 11 21 31 41 51 

I I I I I I 

CACAGCGTGG GAAGCAGCTC TGGGGGAGCT CGGAGCTCCC GATCACGGCT TCTTGGGGGT 60 

75 . AGCTACGGCT GGGTGTGTAG AACGGGGCCG GGGCTGGGGC TGGGTCCCCT AGTGGAGACC 120 

CAAGTGCGAG AGGCAAGAAC TCTGCAGCTT CCTGCCTTCT GGGTCAGTTC CTTATTCAAG 180 

TCTGCAGCCG GCTCCCAGGG AGATCTCGGT GGAACTTCAG AAACGCTGGG CAGTCTGCCT 240 

TTCAACCATG CCCCTGTCCC TGGGAGCCGA GATGTGGGGG CCTGAGGCCT GGCTGCTGCT 300 
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QCTGCTACTG CTGGCATCAT TTACAGGCCG GTGCCCCGCO GGTGAGCTGG AGACCTCAGA 360 

CGTGGTAACT GTGGTGCTGG GCCAGGACGC AAAACTGCCC TGCTTCTACC GAGGGGACTC 420 

CGGCGAGCAA GTGGGGCAAG TGGCATGGGC TCGGGTGGAC GCGGGCGAAG GCGCCCAGGA 480 

ACTAGCGCTA CTGCACTCCA AATACGGGCT TCATGTGAGC CCGGCTTACG AGGGCCGCGT 540 

5 GGAGCAGCCG CCGCCCCCAC GCAACCCCCT GGACGGCTCA GTGCTCCTGC GCAACGCAGT 600 

GCAGGCGGAT GAGGGCGAGT ACGAGTGCCG GGTCAGCACC TTCCCCGCCG GCAGCTTCCA 660 

GGCGCGGCTG CGGCTCCGAG TGCTGGTGCC TCCCCTGCCC TCACTGAATC CTGGTCCAGC 720 

ACTAGAAGAG GGCCAGGGCC TGACCCTGGC AGCCTCCTGC ACAGCTGAGG GCAGCCCAGC 780 

CCCCAGCGTG ACCTGGGACA CGGAGGTCAA AGGCACAACG TCCAGCCGTT CCTTCAAGCA 840 

10 CTCCCGCTCT GCTGCCGTCA CCTCAGAGTT CCACTTGGTG CCTAGCCGCA GCATGAATGG 900 

GCAGCCACTG ACTTGTGTGG TGTCCCATCC TGGCCTGCTC CAGGACCAAA GGATCACCCA 960 

CATCCTCCAC GTGTCCTTCC TTGCTGAGGC CTCTGTGAGG GGCCTTGAAG ACCAAAATCT 1020 

GTGGCACATT GGCAGAGAAG GAGCTATGCT CAAGTGCCTG AGTGAAGGGC AGCCCCCTCC 1080 

CTCATACAAC TGGACACGGC TGGATGGGCC TCTGCCCAGT GGGGTACGAG TGGATGGGGA 1140 

15 CACTTTGGGC TTTCCCCCAC TGACCACTGA GCACAGCGGC ATCTACGTCT GCCATGTCAG 1200 

CAATGAG7TC TCCTCAAGGG ATTCTCAGGT CACTGTGGAT GTTCTTGACC CCCAGOAAGA 1260 

CTCTGGGAAG CAGGTGGACC TAGTGTCAGC CTCGGTGGTG GTGGTGGGTG TGATCGCCGC 1320 

ACTCTTGTTC TGCCTTCTGG TGGTGGTGGT GGTGCTCATG TCCCGATACC ATCGGCGCAA 1330 

GGCCCAGCAG ATGACCCAGA AATATGAGGA GGAGCTGACC CTGACCAGGG AGAACTCCAT 1440 

20 CCGGAGGCTG CATTCCCATC ACACGGACCC CAGGAGCCAG CCGGAGGAGA GTGTAGGGCT 1500 

GAGAGCCGAG GGCCACCCTG ATAGTCTCAA GGACAACAGT AGCTGCTCTG TGATGAGTGA 1560 

AGAGCCCGAG GGCCGCAGTT ACTCCACGCT GACCACGGTG AGGGAGATAG AAACACAGAC 1620 

TGAACTGCTG TCTCCAGGCT CTGGGCGGGC CGAGGAGGAG GAAGATCAGG ATGAAGGCAT 1680 

CAAACAGGCC ATGAACCATT TTGTTCAGGA GAATGGGACC CTACGGGCCA AGCCCACGGG 1740 

25 CAATGGCATC TACATCAATG GGCGGGGACA CCTGGTCTGA CCCAGGCCTG CCTCCCTTCC 1800 

CTAGGCCTGG CTCCTTCTGT TGACATGGGA GATTTTAGCT CATCTTGGGG GCCTCCTTAA 1860 

ACACCCCCAT TTCTTGCGGA AGATGCTCCC CATCCCACTG ACTGCTTGAC CTTTACCTCC 1920 

AACCCTTCTG TTCATCGGGA GGGCTCCACC AATTGAGTCT CTCCCACCAT GCATGCAGGT 1980 

CACTGTGTGT GTGCATGTGT GCCTGTGTGA GTGTTGACTG ACTGTGTGTG TGTGGAGGGG 2040 

30 TGACTGTCCG TGGAGGGGTG ACTGTGTCCG TGGTGTGTAT TATGCTGTCA TATCAGAGTC 2100 

AAGTGAACTG TGGTGTATGT GCCACGGGAT TTGAGTGGTT GCGTGGGCAA CACTGTCAGG 2160 

GTTTGGCGTG TGTGTCATGT GGCTGTGTGT GACCTCTGCC TGAAAAAGCA GGTATTTTCT 2220 

CAGACCCCAG AGCAGTATTA ATGATGCAGA GGTTGGAGGA GAGAGGTGGA GACTGTGGCT 2290 

CAGACCCAGG TGTGCGGGCA TAGCTGGAGC TGGAATCTGC CTCCGGTGTG AGGGAACCTG 2340 

35 TCTCCTACCA CTTCGGAGCC ATGGGGGCAA GTGTGAAGCA GCCAGTCCCT GGGTCAGCCA 2400 

GAGGCTTGAA CTGTTACAGA AGCCCTCTGC CCTCTGGTGG CCTCTGGGCC TGCTGCATGT 2460 

ACATATTTTC TGTAAATATA CATGCGCCGG GAGCTTCTTG CAGGAATACT GCTCCGAATC 2520 

ACTTTTAATT TTTTTCTTTT TTTTTTCTTG CCCTTTCCAT TAGTTGTATT TTTTATTTAT 2580 

TTTTATTTTT ATTTTTTTTT AGAGTTTGAG TCCAGCCTGG ACGATATAGC CAGACCCTGT 2640 

40 CTGTAAAAAA ACCAAAACCC AAAAAAAAAA AAAAAAAAAA 

Seq ID NO: 17 9 Protein Sequence 
Protein Accession #: AAH10423 

45 1 11 21 31 41 51 

I I I I I I 

MPLSLGAEMW GPEAWLLLLL LLASFTGRCP AGELETSDW TWLGQDAKL PCFYRGDSGE 60 

QVGQVAWARV DAGEGAQELA LLHSKYGLHV SPAYEGRVEQ PPPPRNPLDG SVLLRNAVQA 120 

DEGEYECRVS TFPAGSPQAR LRLRVLVPPL PSLNPGPALE EGQGLTLAAS CTAEGSPAPS 180 

50 VTWDTEVKGT TSSRSFKHSR SAAVTSEFHL VPSRSMNGQP LTCWSHPGL LQDQRITHIL 240 

HVSFLAEASV RGLEDQNLWH IGREGAMLKC LSEGQPPPSY NWTRLDGPLP SGVRVDGDTL 300 

GFPPLTTEHS GIYVCHVSNE FSSRDSQVTV DVLDPQEDSG KQVDLVSASV VWGVIAALL 360 

FCLLVWWL MSRYHRRKAQ QMTQKYEEEL TLTRENSIRR LHSHHTDPRS QPEESVGLRA 420 

EGHPDSLKDN SSCSVMSEEP EGRSYSTLTT VREIETQTEL LSPGSGRAEE EEDQDEGIKQ 480 

55 AMNHFVQENG TLRAKPTGNG IYINGRGHLV 

Seq ID NOt 180 DNA Sequence 
Nucleic Acid Accession #: Eos sequence 
^ Coding sequences 482.. 3007 

1 11 21 31 41 51 

I I I 1 I I 

AACTGAGCTA ACAAGAAATA CTAGAAAAGG AGGAAGGAGA ACATTGCTGC AGCTTGGATC 60 

TACAACCTAA GAAAGCAAGA GTGATCAATC TCAGCTCTGT TAAACATCTT GTTTACTTAC 120 

65 TGCATTCAGC AGCTTGCAAA TGGTTAACTA TATGCAAAAA AGTCAGCATA GCTGTGAAGT 180 

ATGCCGTGAA TTTTAATTGA GGGAAAAAGG GACAATTGCT TCAGGATGCT CTAGTATGCA 240 

CTCTGCTTGA AATATTTTCA ATGAAATGCT CAGTATTCTA TCTTTGACCA GAGGTTTTAA 300 

CTTTATGAAG CTATGGGACT TGACAAAAAG TGATATTTGA GAAGAAAGTA CGCAGTGGTT 360 

GGTGTTTTCT TTTTTTTAAT AAAGGAATTG AATTACTTTG AACACCTCTT CCAGCTGTGC 420 

70 ATTACAGATA ACGTCAGGAA GAGTCTCTGC TTTACAGAAT CGGATTTCAT CACATGACAA 480 

CATGAAGCTG TGGATTCATC TCTTTTATTC ATCTCTCCTT GCCTGTATAT CTTTACACTC 540 

CCAAACTCCA GTGCTCTCAT CCAGAGGCTC TTGTGATTCT CTTTGCAATT GTGAGGAAAA 600 

AGATGGCACA ATGCTAATAA ATTGTGAAGC AAAAGGTATC AAGATGGTAT CTGAAATAAG 660 

TGTGCCACCA TCACGACCTT TCCAACTAAG CTTATTAAAT AACGGCTTGA CGATGCTTCA 720 

75 CACAAATGAC TTTTCTGGGC TTACCAATGC TATTTCAATA CACCTTGGAT TTAACAATAT 780 

TGCAGATATT GAGATAGGTG CATTTAATGG CCTTGGCCTC CTGAAACAAC TTCATATCAA 840 

TCACAATTCT TTAGAAATTC TTAAAGAGGA TACTTTCCAT GGACTGGAAA ACCTGGAATT 900 

CCTGCAAGCA GATAACAATT TTATCACAGT GATTGAACCA AGTGCCTTTA GCAAGCTCAA 960 
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CAGACTCAAA GTGTTAATTT TAAATGACAA TGCTATTGAG AGTCTTCCTC CAAACATCTT 1020 

COGATTTGTT CCTTTAACCC ATCTAGATCT TCCTGGAAAT CAATTACAAA CATTGCCTTA 1080 

TGTTGGTTTT CTCGAACACA TTGGCCGAAT ATTGGATCTT CAGTTGGAGG ACAACAAATG 1140 

GGCCTGCAAT TGTGACTTAT TGCAGTTAAA AACTTGGTTG GAGAACATGC CTCCACAGTC 1200 

5 TATAATTGGT GATGTTGTCT GCAACAGCCC TCCATTTTTT AAAGGAAGTA TACTCAGTAG 1260 

ACTAAAGAAG GAATCTATTT GCCCTACTCC ACCAGTGTAT GAAGAACATG AGGATCCTTC 1320 

AGGATCATTA CATCTGGCAG CAACATCTTC AATAAATGAT AGTCGCATGT CAACTAAGAC 1380 

CACGTCCATT CTAAAACTAC CCACCAAAGC ACCAGGTTTG ATACCTTATA TTACAAAGCC 1440 

ATCCACTCAA CTTCCAGGAC CTTACTGCCC TATTCCTTGT AACTGCAAAG TCCTATCCCC IS 00 

10 ATCAGGACTT CTAATACATT GTCAGGAGCG CAACATTGAA AGCTTATCAG ATCTGAGACC 1560 

TCCTCCGCAA AATCCTAGAA AGCTCATTCT AGCGGGAAAT ATTATTCACA GTTTAATGAA 1620 

GTCTGATCTA GTGGAATATT TCACTTTGGA AATGCTTCAC TTGGGAAACA ATCGTATTGA 1680 

AGTTCTTGAA GAAGGATCGT TTATGAACCT AACGAGATTA CAAAAACTCT ATCTAAATGG 1740 

TAACCACCTG ACCAAATTAA GTAAAGGCAT GTTCCTTGGT CTCCATAATC TTGAATACTT 1800 

15 ATATCTTGAA TACAATGCCA TTAAGGAAAT ACTGCCAGGA ACCTTTAATC CAATGCCTAA 1860 

ACTTAAAGTC CTGTATTTAA ATAACAACCT CCTCCAAGTT TTACCACCAC ATATTTTTTC 1920 

AGGGGTTCCT CTAACTAAGG TAAATCTTAA AACAAACCAG TTTACCCATC TACCTGTAAG 1980 

TAATATTTTG GATGATCTTG ATTTACTAAC CCAGATTGAC CTTGAGGATA ACCCCTGGGA 2040 

CTGCTCCTGT GACCTGGTTG GACTGCAGCA ATGGATACAA AAGTTAAGCA AGAACACAGT 2100 

20 GACAGATGAC ATCCTCTGCA CTTCCCCCGG GCATCTCGAC AAAAAGGAAT TGAAAGCCCT 2160 

AAATAGTGAA ATTCTCTGTC CAGGTTTAGT AAATAACCCA TCCATGCCAA CACAGACTAG 2220 

TTACCTTATG GTCACCACTC CTGCAACAAC AACAAATACG GCTGATACTA TTTTACGATC 2280 

TCTTACGGAC GCTGTGCCAC TGTCTGTTCT AATATTGGGA CTTCTGATTA TGTTCATCAC 2340 

TATTGTTTTC TGTGCTGCAG GGATAGTGGT TCTTGTTCTT CACCGCAGGA GAAGATACAA 2400 

25 AAAGAAACAA GTAGATGAGC AAATGAGAGA CAACAGTCCT GTGCATCTTC AGTACAGCAT 2460 

GTATGGCCAT AAAACCACTC ATCACACTAC TGAAAGACCC TCTGCCTCAC TCTATGAACA 2S20 

GCACATGGTG AGCCCCATGG TTCATGTCTA TAGAAGTCCA TCCTTTGGTC CAAAGCATCT 2S80 

GGAAGAGGAA GAAGAGAGGA ATGAGAAAGA AGGAAGTGAT GCAAAACATC TCCAAAGAAG 2640 

TCTTTTGGAA CAGGAAAATC ATTCACCACT CACAGGGTCA AATATGAAAT ACAAAACCAC 2700 

30 GAACCAATCA ACAGAATTTT TATCCTTCCA AGATGCCAGC TCATTGTACA GAAACATTTT 2760 

AGAAAAAGAA AGGGAACTTC AGCAACTGGG AATCACAGAA TACCTAAGGA AAAACATTGC 2820 

TCAGCTCCAG CCTGATATGG AGGCACATTA TCCTGGAGCC CACGAAGAGC TGAAGTTAAT 2880 

GGAAACATTA ATGTACTCAC GTCCAAGGAA GGTATTAGTG GAACAGACAA AAAATGAGTA 2940 

TTTTGAACTT AAAGCTAATT TACATGCTGA ACCTGACTAT TTAGAAGTCC TGGAGCAGCA 3000 

35 AACATAGATG GAGAGTTTGA GGGCTTTCGC AGAAATGCTG TGATTCTGTT TTAAGTCCAT 3060 

ACCTTGTAAA TAAGTGCCTT ACGTGAGTGT GTCATCAATC AGAACCTAAG CACAGCAGTA 3120 

AACTATGGGG AAAAAAAAAG AAGAAGAAAA GAAACTCAGG GATCACTGGG AGAAGCCATG 3180 

GCATTATCTT CAGGCAATTT AGTCTGTCCC AAATAAAATC AATCCTTGCA TGTAAATC 3238 

40 Seq ID NO: 181 Protein Sequence 

Protein Accession #i Eos sequence 

1 11 21 31 41 51 

A* I 1 1 1 1 1 

4 J MKLWIHLFYfi SLLACISLHS QTPVLSSRGS CDSLCNCEEK DGTMLINCEA KGIKMVSEIS 60 

VPPSRPFQLS LLNNGLTMLH TNDFSGLTNA ISIHLGFNNI ADIEIGAFNG LGLLKQIiHIN 120 

HNSLEILKED TFHGLENLEF LQADMMFITV IEPSAFSKLN RLKVLILNDN AIESLPPNIF 180 

RFVPLTHLDL RGNQLQTLPY VGFLEHIGRI LDLQLEDNKW ACNCDLLQLK TWLENMPPQS 240 

IIGDWCNSP PFFKGSILSR LKKESICPTP PVYEEHEDPS GSLHLAATSS INDSRMSTKT 300 

50 TSILKLPTKA PGIjIPYITKP STQLPGPYCP IPCNCKVLSP SGLLIHCQBR NIESLSDLRP 360 

PPQNPRKLIL AGNIIHSLMK SDLVEYFTLE MLHLGNNRIE VLEEGSFMNL TRLQKLYLNG 420 

NHLTKLSKGM FLGLHNLEYL YLEYNAIKEI LPGTFNPMPK LKVLYLNNNL LQVLPPHIFS 480 

GVPLTKVNLK TNQFTHLPVS NILDDLDLLT QIDLEDNPWD CSCDLVGLQQ WIQKLSKNTV 540 

TDDILCTSPG HLDKKELKAL NSEII.CPGLV NNPSMPTQTS YLMVTTPATT TNTADTILRS 600 

55 LTDAVPLSVL ILGLLIMFIT IVFCAAGIW LVLHRRRRYK KKQVDEQMRD NSPVHLQYSM 660 

YGHKTTHHTT ERPSASLYEQ HMVSPMVHVY RSPSFGPKHL EEEEERNEKE GSDAKHLQRS 720 

LLEQENHSPL TGSNMKYKTT NQSTEFLSFQ DASSLYRNIL EKERELQQLG ITEY1»RKNIA 780 

QLQPDMEAHY PGAHEELKLM ETLMYSRPRK VLVEQTKNEY FELKANLHAE PDYLEVLEQQ 840 

60 T 

Seq ID NO: 182 DNA Sequence 
Nucleic Acid Accession #: XM_035292.2 
- Coding sequence: 1..1524 

65 

1 11 21 31 41 51 

I I I I I I 

ATGGCGGGTG CGGGCCCGAA GCGGCGCGCG CTAGCGGCGC CGGCGGCCGA GGAGAAGGAA 60 

GAGGCGCGGG AGAAGATGCT GGCCGCCAAG AGCGCGGACG GCTCGGCGCC GGCAGGCGAG 120 

70 GGCGAGGGCG TGACCCTGCA GCGGAACATC ACGCTGCTCA ACGGCGTGGC CATCATCGTG 180 

GGGACCATTA TCGGCTCGGG CATCTTCGTG ACGCCCACGG GGGTGCTCAA GGAGGCAGGC 240 

TCGCCGGGGC TGGCGCTGGT GGTGTGGGCC GGGTGCGGCG TCTTCTCCAT CGTGGGCGCG 300 

CTCTGCTAOG CGGAGCTCGG CACCACCATC TCCAAATCGG GCGGCGACTA CGCCTACATG 360 

_ c CTGGAGGTCT ACGGCTCGCT GCCCGCCTTC CTCAAGCTCT GGATCGAGCT GCTCATCATC 420 

75 CGGCCTTCAT CGCAGTACAT CGTGGCCCTG GTCTTCGCCA CCTACCTGCT CAAGCCGCTC 480 

TTCCCCACCT GCCCGGTGCC CX3AGGAGGCA GCCAAGCTCG TGGCCTGCCT CTGCGTGCTG 540 

CTGCTCACGG CCGTGAACTG CTACAGCGTG AAGGCCGCCA CCCGGGTCCA GGATGCCTTT 600 

GCCGCCGCCA AGCTCCTGGC CCTGGCCCTG ATCATCCTGC TGGGCTTCGT CCAGATCGGA 660 
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AAGGGTGATG TGTCCAATCT AGATCCCAAC TTCTCATTTG AAGGCACCAA ACTGGATGTG 720 

GGGAACATTG TGCTGGCATT ATACAGCGGC CTCTTTGCCT ATGGAGGATG GAATTACTTG 780 

AATTTCGTCA CAGAGGAAAT GATCAACCCC TACAGAAACC TGCCCCTGGC CATCATCATC 840 

TCCCTGCCCA TCGTGACGCT GGTGTACGTG CTGACCAACC TGGCCTACTT CACCACCCTG 900 

5 TCCACCGAGC AGATGCTGTC GTCCGAGGCC GTGGCCGTGG ACTTCGGGAA CTATCACCTG 960 

GGCGTCATGT CCTGGATCAT CCCCGTCTTC GTGGGCCTGT CCTGCTTCGG CTCCGTCAAT 1020 

GGGTCCCTGT TCACATCCTC CAGGCTCTTC TTCGTGGGGT CCCGGGAAGG CCACCTGCCC 10 BO 

TCCATCCTCT CCATGATCCA CCCACAGCTC CTCACCCCCG TGCCGTCCCT CGTGTTCACG 1140 

TGTGTGATGA CGCTGCTCTA CGCCTTCTCC AAGGACATCT TCTCCGTCAT CAACTTCTTC 1200 

10 AGCTTCTTCA ACTGGCTCTG CGTGGCCCTG GCCATCATCG GCATGATCTG GCTGCGCCAC 12 60 

AGAAAGCCTG AGCTTGAGCG GCCCATCAAG GTGAACCTGG CCCTGCCTGT GTTCTTCATC 1320 

CTGGCCTGCC TCTTCCTGAT CGCCGTCTCC TTCTGGAAGA CACCCGTGGA GTGTGGCATC 1380 

GGCTTCACCA TCATCCTCAG CGGGCTGCCC GTCTACTTCT TCGGGGTCTG GTGGAAAAAC 1440 

AAGCCCAAGT GGCTCCTCCA GGGCATCTTC TCCACGACCG TCCTGTGTCA GAAGCTCATG IS 00 

15 CAGGTGGTCC CCCAGGAGAC ATAG 

Seq ID NO: 183 Protein Sequence 
Protein Accession 8: XP_035292.2 

20 1 11 21 31 41 51 

I ! I I I I 

MAG AGP ICR RA LAAPAAEEKE EAREKMLAAK SADGSAPAGE GEGVTLQRNI TLLNGVAI IV 60 

GTIIGSGIFV TPTGVLKEAG SPGLALWWA ACGVPSIVGA LCYAELGTTI SKSGGDYAYM 120 

LEVYGSLPAF LKLWIELLII RPSSQYIVAL VFATYLLKPL FPTCPVPEEA AKLVACLCVL 180 

25 LLTAVNCYSV KAATRVQDAF AAAXLLALAL IILLGFVQIG KGDVSNLDPN FSFEGTKLDV 240 

GNIVLALYSG LFAYGGWNYI, NFVTEEMINP YRNLPLAI 1 1 SLPIVTLVYV LTNLAYFTTT* 300 

STEQMLSSEA VAVDFGNYHL GVMSWIIPVF VGLSCFGSVN GSLFTSSRLF FVGSREGHLP 360 

SILSMIHPQL LTPVPSLVFT CVMTLLYAFS KD.1FSVINFF SFFNWLCVAL AIIGMIWLRH 420 

RKPELERPIK VNLALPVFFI LACLFLIAVS FWKTPVECGI GFTIILSGLP VYFFGVWWKN 480 

30 KPKWLLQGIF STTVLCQKLM QWPQET 

Seq ID NO: 184 DNA Sequence 
Nucleic Acid Accession #: NM_00526B.l 
^ Coding sequence: 1..822 

1 11 21 31 41 51 

i ! II 1 i 

ATGAACTGGA GTATCTTTGA GGGACTCCTG AOTGGGGTCA ACAAGTACTC CACAGCCTTT 60 

GGGCGCATCT GGCTGTCTCT GGTCTTCATC TTCCGCGTGC TGGTGTACCT GGTGACGGCC 120 

40 GAGCGTGTGT GGAGTGATGA CCACAAGGAC TTCGACTGCA ATACTCGCCA GCCCGGCTGC 180 

TCCAACGTCT GCTTTGATGA GTTCTTCCCT GTGTCCCATG TGCGCCTCTG GGCCCTGCAG 240 

CTTATCCTGG TGACATGCCC CTCACTGCTC GTGGTCATGC ACGTGGCCTA CCGGGAGGTT 300 

CAGGAGAAGA GGCACCGAGA AGCCCATGGG GAGAACAGTG GGCGCCTCTA CCTGAACCCC 360 

GGCAAGAAGC GGGGTGGGCT CTGGTGGACA TATGTCTGCA GCCTAGTGTT CAAGGCGAGC 420 

45 GTGGACATCG CCTTTCTCTA TGTGTTCCAC TCATTCTACC CCAAATATAT CCTCCCTCCT 480 

GTGGTCAAGT GCCACGCAGA TCCATGTCCC AATATAGTGG ACTGCTTCAT CTCCAAGCCC 540 
TCAGAGAAGA ACATTTTCAC CCTCTTCATG GTGGCCACAG CTGCCATCTG CATCCTGCTC ' 600 

AACCTCGTGG AGCTCATCTA CCTGGTGAGC AAGAGATGCC ACGAGTGCCT GGCAGCAAGG 660 

AAAGCTCAAG CCATGTGCAC AGGTCATCAC CCCCACGGTA CCACCTCTTC CTGCAAACAA 720 

50 GACGACCTCC TTTCGGGTGA CCTCATCTTT CTGGGCTCAG ACAGTCATCC TCCTCTCTTA 780 
CCAGACCGCC CCCGAGACCA TGTGAAGAAA ACCATCTTGT GA 

Seq ID NO: 185 Protein Sequence 
^ Protein Accession #: NP_005259.l 

1 11 21 31 41 51 

i 1 I I I I 

MNWSIFEGLL SGVNKYSTAF GRIWLSLVPI FRVLVYLVTA ERVWSDDHKD FDCNTRQPGC 60 

■ SNVCFDEFFP VSHVRLWALQ LILVTCPSLL WMHVAYREV QEKRHREAHG ENSGRLYLNP 12 0 

60 GKKRGGLWWT YVCSLVFKAS VDIAFLYVFH SFYPKYILPP WKCHADPCP NIVDCFISKP 180 

SEKNIFTLFM VATAAICILL NLVELIYLVS KRCHECLAAR KAQAMCTGHH PHGTTSSCKQ 240 

DDLLSGDLIF LGSDSHPPLL PDRPRDHVKK TIL 

£c Seq ID NO: 186 DNA Sequence 
65 Nucleic Acid Accession Jh NM_002391.1 
Coding sequence: 2 5.. 4 57 

1 11 21 31 41 51 

W 1 1 1 1 1 ' 

/U CGGGCGAAGC AGCGCGGGCA GCGAGATGCA GCACCGAGGC TTCCTCCTCC TCACCCTCCT 60 

CGCCCTGCTG GCGCTCACCT CCGCGGTCGC CAAAAAGAAA GATAAGGTGA AGAAGGGCGG 120 

CCCGGGGAGC GAGTGCGCTG AGTGGGCCTG GGGGCCCTGC ACCCCCAGCA GCAAGGATTG 180 

CGGCGTGGGT TTCCGCGAGG GCACCTGCGG GGCCCAGACC CAGCGCATCC GGTGCAGGGT 240 

GCCCTGCAAC TGGAAGAAGG AGTTTGGAGC CGACTGCAAG TACAAGTTTG AGAACTGGGG 300 

75 TGCGTGTGAT GGGGGCACAG GCACCAAAGT CCGCCAAGGC ACCCTGAAGA AGGCGCGCTA 360 

CAATGCTCAG TGCCAGGAGA CCATCCGCGT CACCAAGCCC TQCACCCCCA AGACCAAAGC 420 

AAAGGCCAAA GCCAAGAAAG GGAAGGGAAA GGACTAGACG CCAAGCCTGG ATGCCAAGGA 480 

GCCCCTGGTG TCACATGGGG CCTGGCCACG CCCTCCCTCT CCCAGGCCCG AGATGTGACC 540 
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CACCAGTGCC TTCTGTCTGC TCGTTAGCTT TAATCAATCA TGCCCTGCCT TGTCCCTCTC 600 

ACTCCCCAGC CCCACCCCTA AGTGCCCAAA GTGGGGAGGG ACAAGGGATT CTGGGAAGCT 660 

TGAGCCTCCC CCAAAGCAAT GTGAGTCCCA GAGCCCGCTT TTGTTCTTCC CCACAATTCC 720 

ATTACTAAGA AACACATCAA ATAAACTGAC TTTTTCCCCC CAATAAAAGC TCTTCTTTTT 780 
TAATAT 

Seq ID NO: 187 Protein Sequence 
Protein Accession 8: NP_002382.1 

1 11 21 31 41 51 

I I I ! I I 

MQHRGFLLLT LLALLALTSA VAKKKDKVKK GGPGSECAEW AWGPCTPSSK DCGVGPREGT 60 

CGAQTQRIRC RVPCNWKKEF GADCKYXFEN WGACDGGTGT KVRQGTLKKA RYNAQOQETI 120 
RVTKPCTPKT KAKAKAKKGK GKD 

Seq ID NO: 188 DNA Sequence 
Nucleic Acid Accession 8: NMJ515507 
Coding sequence: 241-1902 

1 11 21 31 41 51 

I I I I I I 

CCGCAGAGGA GCCTCGGCCA GGCTAGCCAG GGCGCCCCCA GCCCCTCCCC AGGCCGCGAG 60 

CGCCCCTGCC GCGGTGCCTG GCCTCCCCTC CCAGACTGCA GGGACAGCAC CCGGTAACTG 120 

CGAGTGGAGC GGAGGACCCG AGCGGCTGAG GAGAGAGGAG GCGGCGGCTT AGCTGCTACG 180 

GGGTCCGGCC GGCGCCCTCC CGAGGGGGGC TCAGGAGGAG GAAGGAGGAC CCGTGCGAGA 240 

ATGCCTCTGC CCTGGAGCCT TGCGCTCCCG CTGCTGCTCT CCTGGGTGGC AGGTGGTTTC 300 

GGGAACGCGG CCAGTGCAAG GCATCACGGG TTGTTAGCAT CGGCACGTCA GCCTGGGGTC 360 

TGTCACTATG GAACTAAACT GGCCTGCTGC TACGGCTGGA GAAGAAACAG CAAGGGAGTC 420 

TGTGAAGCTA CATGCGAACC TGGATGTAAG TTTGGTGAGT GCGTGGGACC AAACAAATGC 480 

AGATGCTTTC CAGGATACAC CGGGAAAACC TGCAGTCAAG ATGTGAATGA GTGTGGAATG 540 

AAACCCCGGC CATGCCAACA CAGATGTGTG AATACACACG GAAGCTACAA GTGCTTTTGC 600 

CTCAGTGGCC ACATGCTCAT GCCAGATGCT AOGTGTGTGA ACTCTAGGAC ATGTGCCATG 660 

ATAAACTGTC AGTACAGCTG TGAAGACACA GAAGAAGGGC CACAGTGCCT GTGTCCATCC 720 

TCAGGACTCC GCCTGGCCCC AAATGGAAGA GACTGTCTAG ATATTGATGA ATGTGCCTCT 780 

GGTAAAGTCA TCTGTCCCTA CAATCGAAGA TGTGTGAACA CATTTGGAAG CTACTACTGC 840 

AAATGTCACA TTGGTTTCGA ACTGCAATAT ATCAGTGGAC GATATGACTG TATAGATATA 900 

AATGAATGTA CTATGGATAG CCATACGTGC AGCCACCATG CCAATTGCTT CAATACCCAA 960 

GGGTCCTTCA AGTGTAAATG CAAGCAGGGA TATAAAGGCA ATGGACTTCG GTGTTCTGCT 1020 

ATCCCTGAAA ATTCTGTGAA GGAAGTCCTC AGAGCACCTG GTACCATCAA AGACAGAATC 1080 

AAGAAGTTGC TTGCTCACAA AAACAGCATG AAAAAGAAGG CAAAAATTAA AAATGTTACC 1140 

CCAGAACCCA CCAGGACTCC TACCCCTAAG GTGAACTTGC AGCCCTTCAA CTATGAAGAG 1200 

ATAGTTTCCA GAGGCGGGAA CTCTCATGGA GGTAAAAAAG GGAATGAAGA GAAAATGAAA 1260 

GAGGGGCTTG AGGATGAGAA AAGAGAAGAG AAAGCCCTGA AGAATGACAT AGAGGAGCGA 1320 

AGCCTGCGAG GAGATGTGTT TTTCCCTAAG GTGAATGAAG CAGGTGAATT CGGCCTGATT 1380 

CTGGTCCAAA GGAAAGCGCT AACTTCCAAA CTGGAACATA AAGATTTAAA TATCTCGGTT 1440 

GACTGCAGCT TCAATCATGG GATCTGTGAC TGGAAACAGG ATAGAGAAGA TGATTTTGAC 1500 

TGGAATCCTG CTGATCGAGA TAATGCTATT GGCTTCTATA TGGCAGTTCC GGCCTTGGCA 1560 

GGTCACAAGA AAGACATTGG CCGATTGAAA CTTCTCCTAC CTGACCTGCA ACCCCAAAGC 1620 

AACTTCTGTT TGCTCTTTGA TTACCGGCTG GCCGGAGACA AAGTCGGGAA ACTTCGAGTG 1680 

TTTGTGAAAA ACAGTAACAA TGCCCTGGCA TGGGAGAAGA CCACGAGTGA GGATGAAAAG 1740 

TGGAAGACAG GGAAAATTCA GTTGTATCAA GGAACTGATG CTACCAAAAG CATCATTTTT 1800 

GAAGCAGAAC GTGGCAAGGG CAAAACCGGC GAAATCGCAG TGGATGGCGT CTTGCTTGTT 1860 

TCAGGCTTAT GTCCAGATAG CCTTTTATCT GTGGATGACT GAATGTTACT ATCTTTATAT 1920 

TTGACTTTGT ATGTCAGTTC CCTGGTTTTT TTGATATTGC ATCATAGGAC CTCTGGCATT 1980 

TTAGAATTAC TAGCTGAAAA ATTGTAATGT ACCAACAGAA ATATTATTGT AAGATGCCTT 2040 

TCTTGTATAA GATATGCCAA TATTTGCTTT AAATATCATA TCACTGTATC TTCTCAGTCA 2100 

TTTCTGAATC TTTCCACATT ATATTATAAA ATATGGAAAT GTCAGTTTAT CTCCCCTCCT 2160 

CAGTATATCT GATTTGTATA AGTAAGTTGA TGAGCTTCTC TCTACAACAT TTCTAGAAAA 2220 

TAGAAAAAAA AGCACAGAGA AATGTTTAAC TGTTTGACTC TTATGATACT TCTTGGAAAC 2280 

TATGACATCA AAGATAGACT TTTGCCTAAG TGGCTTAGCT GGGTCTTTCA TAGCCAAACT 2340 
TGTATATTTA AATTCTTTGT AATAATAATA TCCAAATCAT CAAAAAAAAA AAAAAAAA 

Seq ID NO: 189 Protein Sequence 
Protein Accession #: NP_056322.2 

1 11 21 31 41 51 

I I I I I I 

MPLPWSLALP IiLLSWVAGGP GNAASARHHG LLASARQPGV CHYGTKLACC YGWRRNSKGV 60 

CEATCEPGCK FGECVGPNKC RCPPGYTGKT CSQDVNECGM KPRPCQHRCV NTHGSYKCFC 120 

LSGHMLMPDA TCVNSRTCAM INCQYSCEDT EEGPQCLCPS SGLRLAPNGR DCLDIDECAS 180 

GKVICPYNRR CVNTFGSYYC KCHIGFELQY ISGRYDCIDI NECTMDSHTC SHHANCFNTQ 240 

GSFKCKCKQG YKGNGLRCSA IPENSVKEVL RAPGTIKDRI KKLLAHKNSM KKKAKIKNVT 300 

PEPTRTPTPK VNLQPFNYEE IVSRGGNSHG GKKGNEEKMK EGLEDEKREE KALKNDIEER 360 

SLRGDVPFPK VNBAGEFGLI LVQRKALTSK LEHKDLNISV DCSFNHGICD WKQDREDDFD 420 

WNPADRDNAI GFYMAVPALA GHKKDIGRLK LLLPDLQPQS NFCLLFDYRL AGDKVGKLRV 480 

FVKNSNNALA WEKTTSEDEK WKTGKIQLYQ GTDATKSIIF EAERGKGKTG EIAVDGVLLV 540 
SGLCPDSLIiS VDD 
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Seq ID MO j 190 DNA Sequence 
Nucleic Acid Accession th NM_006475 
Coding sequence: 12.. 2522 

5 1 11 21 31 41 51 

I I i I I i 

AGAGACTCAA GATGATTCCC TTTTTACCCA TGTTTTCTCT ACTATTGCTG CTTATTGTTA 60 

ACCCTATAAA CGCCAACAAT CATTATGACA AGATCTTGGC TCATAGTCGT ATCAGGGGTC 120 

GGGACCAAGG CCCAAATGTC TGTGCCCTTC AACAGATTTT GGGCACCAAA AAGAAATACT 180 

10 TCAGCACTTG TAAGAACTGG TATAAAAAGT CCATCTGTGG ACAGAAAACG ACTGTTTTAT 240 

ATGAATGTTG CCCTGGTTAT ATGAGAATGG AAGGAATGAA AGGCTGCCCA GCAGTTTTGC 300 

CCATTGACCA TGTTTATGGC ACTCTGGGCA TCGTGGGAGC CACCACAACG CAGCGCTATT 360 

CTGACGCCTC AAAACTGAGG GAGGAGATCG AGGGAAAGGG ATCCTTCACT TACTTTGCAC 420 

CGAGTAATGA GGCTTGGGAC AACTTGGATT CTGATATCCG TAGAGGTTTG GAGAGCAACG 480 

15 TGAATGTTGA ATTACTGAAT GCTTTACATA GTCACATGAT TAATAAGAGA ATGTTGACCA 540 

AGGACTTAAA AAATGGCATG ATTATTCCTT CAATGTATAA CAATTTGGGG CTTTTCATTA 600 

ACCATTATCC TAATGGGGTT GTCACTGTTA ATTGTGCTCG AATCATCCAT GGGAACCAGA 660 

TTGCAACAAA TGGTGTTGTC CATGTCATTG ACCGTGTGCT TACACAAATT GGTACCTCAA 720 

TTCAAGACTT CATTGAAGCA GAAGATGACC TTTCATCTTT TAGAGCAGCT GCCATCACAT 780 

20 CGGACATATT GGAGGCCCTT GGAAGAGACG GTCACTTCAC ACTCTTTGCT CCCACCAATG 840 

AGGCTTTTGA GAAACTTCCA CGAGGTGTCC TAGAAAGGTT CATGGGAGAC AAAGTGGCTT 900 

CCGAAGCTCT TATGAAGTAC CACATCTTAA ATACTCTCCA GTGTTCTGAG TCTATTATGG 960 

GAGGAGCAGT CTTTGAGACG CTGGAAGGAA ATACAATTGA GATAGGATGT GACGGTGACA 1020 

GTATAACAGT AAATGGAATC AAAATGGTGA ACAAAAAGGA TATTGTGACA AATAATGGTG 1080 

25 TGATCCATTT GATTGATCAG GTCCTAATTC CTGATTCTGC CAAACAAGTT ATTGAGCTGG 1140 

CTGGAAAACA GCAAACCACC TTCACGGATC TTGTGGCCCA ATTAGGCTTG GCATCTGCTC 1200 

TGAGGCCAGA TGGAGAATAC ACTTTGCTGG CACCTGTGAA TAATGCATTT TCTGATGATA 1260 

CTCTCAGCAT GGTTCAGOGC CTCCTTAAAT TAATTCTGCA GAATCACATA TTGAAAGTAA 1320 

AAGTTGGCCT TAATGAGCTT TACAACGGGC AAATACTGGA AACCATCGGA GGCAAACAGC 1380 

30 TCAGAGTCTT CGTATATCGT ACAGCTGTCT GCATTGAAAA TTCATGCATG GAGAAAGGGA 1440 

GTAAGCAAGG GAGAAACGGT GCGATTCACA TATTCCGCGA GATCATCAAG CCAGCAGAGA 1500 

AATCCCTCCA TGAAAAGTTA AAACAAGATA AGCGCTTTAG CACCTTCCTC AGCCTACTTG 1560 

AAGCTGCAGA CTTGAAAGAG CTCCTGACAC AACCTGGAGA CTGGACATTA TTTGTGCCAA 1620 

CCAATGATGC TTTTAAGGGA ATGACTAGTG AAGAAAAAGA AATTCTGATA CGGGACAAAA 1680 

35 ATGCTCTTCA AAA CAT CAT T CTTTATCACC TGACACCAGG AGTTTTCATT GGAAAAGGAT 1740 

TTGAACCTGG TGTTACTAAC ATTTTAAAGA CCACACAAGG AAGCAAAATC TTTCTGAAAG 1B00 

AAGTAAATGA TACACTTCTG GTGAATGAAT TGAAATCAAA AGAATCTGAC ATCATGACAA 1860 

CAAATGGTGT AATTCATGTT OTAGATAAAC TCCTCTATCC AGCAGACACA CCTGTTGGAA 1920 

ATGATCAACT GCTGGAAATA CTTAATAAAT TAATCAAATA CATCCAAATT AAGTTTGTTC 1980 

40 GTGGTAGCAC CTTCAAAGAA ATCCCCGTGA CTGTCTATAC AACTAAAATT ATAACCAAAG 2040 

TTGTGGAACC AAAAATTAAA GTGATTGAAG GCAGTCTTCA GCCTATTATC AAAACTGAAG 2100 

GACCCACACT AACAAAAGTC AAAATTGAAG GTGAACCTGA ATTCAGACTG ATTAAAGAAG 2160 

GTGAAACAAT AACTGAAGTG ATCCATGGAG AGCCAATTAT TAAAAAATAC ACCAAAATCA 2220 

TTGATGGAGT GCCTGTGGAA ATAACTGAAA AAGAGACACG AGAAGAACGA ATCATTACAG 2280 

45 GTCCTGAAAT AAAATACACT AGGATTTCTA CTGGAGGTGG AGAAACAGAA GAAACTCTGA 2340 

AGAAATTGTT ACAAGAAGAG GTCACCAAGG TCACCAAATT CATTGAAGGT GGTGATGGTC 2400 

ATTTATTTGA AGATGAAGAA ATTAAAAGAC TGCTTCAGGG AGACACACCC GTGAGGAAGT 2460 

TGCAAGCCAA CAAAAAAGTT CAAGGTTCTA GAAGACGATT AAGGGAAGGT CGTTCTCAGT 2520 

GAAAATCCAA AAACCAGAAA AAAATGTTTA TACAACCCTA AGTCAATAAC CTGACCTTAG 2580 

50 AAAATTGTGA GAGCCAAGTT GACTTCAGGA ACTGAAACAT CAGCACAAAG AAGCAATCAT 2640 

CAAATAATTC TGAACACAAA TTTAATATTT TTTTTTCTGA ATGAGAAACA TGAGGGAAAT 2700 

TGTGGAGTTA GCCTCCTGTG GTAAAGGAAT TGAAGAAAAT ATAACACCTT ACACCCTTTT 2760 

TCATCTTGAC ATTAAAAGTT CTGGCTAACT TTGGAATCCA TTAGAGAAAA ATCCTTGTCA 2820 

CCAGATTCAT TACAATTCAA ATCGAAGAGT TGTGAACTGT TATCCCATTG AAAAGACCGA 2880 

55 GCCTTGTATG TATGTTATGG ATACATAAAA TGCACGCAAG CCATTATCTC TCCATGGGAA 2940 

GCTAAGTTAT AAAAATAGGT GCTTGGTGTA CAAAACTTTT TATATCAAAA GGCTTTGCAC 3000 

ATTTCTATAT GAGTGGGTTT ACTGGTAAAT TATGTTATTT TTTACAACTA ATTTTGTACT 3060 

CTCAGAATGT TTGTCATATG CTTCTTGCAA TGCATATTTT TTAATCTCAA ACGTTTCAAT 3120 

AAAACCATTT TTCAGATATA AAGAGAATTA CTTCAAATTG AGTAATTCAG AAAAACTCAA 3180 

60 GATTTAAGTT AAAAAGTGGT TTGGACTTGG GAA 



Seq ID NO: 191 Protein Sequence 
- Protein Accession ft: BAA02836.1 

65 

1 11 21 31 41 51 

I I I I I I 

MIPFLPMFSL IaLJjLIVNPIN ANNHYDKILA HSRIRGRDQG PNVCALQQIL GTKKKVFSTC 60 

KNWYKKSICG QKTTVLYECC PGYMRMEGMK GCPAVLPIDH VYGTLGIVGA TTTQRYSDAS 120 

70 KLREEIEGKG SFTYFAPSNE AWDRLDSDIR RGLESNVNVE LLNALHSHMI NKRMLTKDLK 180 

NGMIIPSMYN NLGLFINHYP NGWTVNCAR IIHGNQIATN GWHVIDRVL TQIGTSIQDF 240 

IEAEDDLSSF RAAAITSDIL EALGRDGHFT LFAPTNEAFE KLPRGVLERP MGDKVASEAIi 300 

MKYHILNTLQ CSESIMGGAV FETLEGNTIE IGCDGDSITV NGIKMVNKKD IVTNNGVIHL 360 

IDQVLIPDSA KQVIELAGKQ QTTFTDLVAQ LGLASALRPD GEYTLLAPVN NAFSDDTLSM 420 

75 VQRLLKLILQ NHILKVKVGL NELYNGQILE TIGGKQLRVF VYRTAVCIEN SCMEKGSKQG 480 

RNGAIHIFRE IIKPAEKSLH EKLKQDKRFS TFLSLLEAAD LKELLTQPGD WTLFVPTNDA 540 

FKGMTSEEKB ILIRDKNALQ NIILYHLTPG VFIGKGFEPG VTNILKTTQG SKIFLKEVND 600 

TLLVNELKSK ESDIMTTNOV IHWDKLLYP ADTPVGNDQL LEILNKLIKY IQIKFVRGST 660 
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PKEIPVTVYT TKIITKWEP KIKVIEGSLQ PIIKTEGPTL TKVKIEGEPB FRLIKEGETI 720 

TEVIHGEPII KKYTKIIDGV PVEITEKETR EERI ITGPEI KYTRISTGGG ETEETLKKLL 780 
QBEVTKVTKF IEGGDGHLFE DEEIKRLLQG DTPVRKLQAN KKVQGSRRRL REGRSQ 

Seq ID NOi 192 DMA Sequence 
Nucleic Acid Accession 8 j NM_006670 
Coding sequence: 85.. 1347 

1 11 21 31 41 SI 

I I I I I I 

CCGGCTCGCG CCCTCCGGGC CCAGCCTCCC GAGCCTTCGG AGCGGGCGCC GTCCCAGCCC 60 

AGCTCCGGGG AAACGCGAGC CGCGATGCCT GGGGGGTGCT CCCGGGGCCC CGCCGCCGGG 120 

GACGGGCGTC TGCGGCTGGC GCGACTAGCG CTGGTACTCC TGGGCTGGGT CTCCTCGTCT 180 

TCTCCCACCT CCTCGGCATC CTCCTTCTCC TCCTCGGCGC CGTTCCTGGC TTCCGCCGTG 240 

TCCGCCCAGC CCCCGCTGCC GGACCAGTGC CCCGCGCTGT GCGAGTGCTC CGAGGCAGCG 300 

CGCACAGTCA AGTGCGTTAA CCGCAATCTG ACCGAGGTGC CCACGGACCT GCCCGCCTAC 360 

GTGCGCAACC TCTTCCTTAC CGGCAACCAG CTGGCCGTGC TCCCTGCCGG CGCCTTCGCC 420 

CGCCGGCCGC CGCTGGCGGA GCTGGCCGCG CTCAACCTCA GCGGCAGCCG CCTGGACOAG 480 

GTGCGCGCGG GCGCCTTCGA GCATCTGCCC AGCCTGCGCC AGCTCGACCT CAGCCACAAC 540 

CCACTGGCCG ACCTCAGTCC CTTCGCTTTC TCGGGCAGCA ATGCCAGCGT CTCGGCCCCC 600 

AGTCCCCTTG TGGAACTGAT CCTGAACCAC ATCGTGCCCC CTGAAGATGA GCGGCAGAAC 660 

CGGAGCTTCG AGGGCATGGT GGTGGCGGCC CTGCTGGCGG GCCGTGCACT GCAGGGGCTC 720 

CGCCGCTTGG AGCTGGCCAG CAACCACTTC CTTTACCTGC CGOGGGATGT GCTGGCCCAA 780 

CTGCCCAGCC TCAGGCACCT GGACTTAAGT AATAATTCGC TGGTGAGCCT GACCTACGTG 840 

TCCTTCCGCA ACCTGACACA TCTAGAAAGC CTCCACCTGG AGGACAATGC CCTCAAGGTC 900 

CTTCACAATG GCACCCTGGC TGAGTTGCAA GGTCTACCCC ACATTAGGGT TTTCCTGGAC 960 

AACAATCCCT GGGTCTGCGA CTGCCACATG GCAGACATGG TGACCTGGCT CAAGGAAACA 1020 

GAGGTAGTGC AGGGCAAAGA CCGGCTCACC TGTGCATATC CGGAAAAAAT GAGGAATCGG 1080 

GTCCTCTTGG AACTCAACAG TGCTGACCTG GACTGTGACC CGATTCTTCC CCCATCCCTG 1140 

CAAACCTCTT ATGTCTTCCT GGGTATTGTT TTAGCCCTGA TAGGCGCTAT TTTCCTCCTG 1200 

GTTTTGTATT TGAACCGCAA GGGGATAAAA AAGTGGATGC ATAACATCAG AGATGCCTGC 1260 

AGGGATCACA TGGAAGGGTA TCATTACAGA TATGAAATCA ATGCGGACCC CAGATTAACA 1320 

AACCTCAGTT CTAACTCGGA TGTCTGAGAA ATATTAGAGG ACAGACCAAG GACAACTCTG 1380 

CATGAGATGT AGACTTAAGC TTTATCCCTA CTAGGCTTGC TCCACTTTCA TCCTCCACTA 1440 

TAGATACAAC GGACTTTGAC TAAAAGCAGT GAAGGGGATT TGCTTCCTTG TTATGTAAAG 1500 

TTTCTCGGTG TGTTCTGTTA ATGTAAGACG ATGAACAGTT GTGTATAGTG TTTTACCCTC 1560 

TTCTTTTTCT TGGAACTCCT CAACACGTAT GGAGGGATTT TTCAGGTTTC AGCATGAACA 1620 

TGGGCTTCTT GCTGTCTGTC TCTCTCTCAG TACAGTTCAA GGTGTAGCAA GTGTACCCAC 1680 

ACAGATAGCA TTCAACAAAA GCTGCCTCAA CTTTTTCGAG AAAAATACTT T ATT CAT AAA 1740 

TATCAGTTTT ATTCTCATGT ACCTAAGTTG TGGAGAAAAT AATTGCATCC TATAAACTGC 1800 

CTGCAGACGT TAGCAGGCTC TTCAAAATAA CTCCATGGTG CACAGGAGCA CCTGCATCCA 1860 

AGAGCATGCT TACATTTTAC TGTTCTGCAT ATTACAAAAA ATAACTTGCA ACTTCATAAC 1920 

TTCTTTGACA AAGTAAATTA CTTTTTTGAT TGCAGTTTAT ATGAAAATGT ACTGATTTTT 1980 

TTTTAATAAA CTGCATCGAG ATCCAACCGA CTGAATTGTT AAAAAAAAAA AAAAATAAAG 2040 
ATTCTTAAAA GAA 

Seq ID NO: 193 Protein Sequence 
Protein Accession #: CAA82324.1 

1 11 21 31 41 51 

I I I 1 1 I 

MPGGCSRGPA AGDGRLRLAR LALVLLGWVS SSSPTSSASS FSSSAPFLAS AVSAQPPLPD 60 

QCPALCECSE AARTVKCVNR NLTEVPTDLP AYVRNLFLTG NQLAVLPAGA PARRPPLAEL "120 

AALNLSGSRL DEVRAGAFEH LPSLRQLDLS HNPLADLSPP AFSGSNASVS APSPLVELIL 180 

NHIVPPEDER QNRSFEGMW AALLAGRALQ GLRRLELASN EFLYI»PRDVIi AQLPSLRHLD 240 

luSNNSLVSLT YVSFRNLTHL ESLHLEDNAL KVLHNGTLAE LGGLPHIRVF LDNNPWVCDC 300 

HMADMVTWLK ETEWQGKDR LTCAYPEKMR NRVLLELNSA DLDCDPILPP SLQTSYVFLG 360 
IVLALIGAIF LLVLYLNRKG IKKWMHNIRD ACRDHMEGYH YRYEINADPR LTNLSSNSDV 

Seq ID NO: 194 DNA Sequence 
Nucleic Acid Accession 8: NM_014400 
Coding sequence: 86.. 1126 

1 • 11 21 31 41 51 

i I I I I I 

GGTTACTCAT CCTGGGCTCA GGTAAGAGGG CCCGAGCTCG GAGGCGGCAC ACCCAGGGGG 60 

GACGCCAAGG GAGCAGGACG GAGCCATGGA CCCCGCCAGG AAAGCAGGTG CCCAGGCCAT 120 

GATCTGGACT GCAGGCTGGC TGCTGCTGCT GCTGCTTCGC GGAGGAGCGC AGGCCCTGGA 180 

GTGCTACAGC TGCGTGCAGA AAGCAGATGA CGQATGCTCC CCGAACAAGA TGAAGACAGT 240 

GAAGTGCGCG CCGGGCGTGG ACGTCTGCAC CGAGGCCGTG GGGGCGGTGG AGACCATCCA 300 

CGGACAATTC TCGCTGGCAG TGCSGGGTTG CGGTTCGGGA CTCCCCGGCA AGAATGACCG 360 

CGGCCTGGAT CTTCACGGGC TTCTGGCGTT CATCCAGCTG CAGCAATGCG CTCAGGATCG 420 

CTGCAACGCC AAGCTCAACC TCACCTCGCG GGCGCTCGAC CCGGCAGGTA ATGAGAGTGC 480 

ATACCCGCCC AACGGCGTGG AGTGCTACAG CTGTGTGGGC CTGAGCCGGG AGGCGTGCCA 540 

GGGTACATOG CCGCCGGTCG TGAGCTGCTA CAACGCCAGC GATCATGTCT ACAA GGGCTG 600 

CTTCGACGGC AACGTCACCT TGACGGCAGC TAATGTGACT GTGTCCTTGC CTGTCCGGGG 660 

CTGTGTCCAG GATGAATTCT GCACTCGGGA TGGAGTAACA GGCCCAGGGT TCACGCTCAG 720 
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TGGCTCCTGT TGCCAGGGGT CCCGCTGTAA CTCTGACCTC CGCAACAAGA CCTACTTCTC 780 

CCCTCGAATC CCACCCCTTG TCCGGCTGCC CCCTCCAGAG CCCACGACTQ TGGCCTCAAC 840 

CACATCTGTC ACCACTTCTA CCTCCOCCCC AGTGAGACCC ACATCCACCA CCAAACCCAT 900 

GCCAGCGCCA ACCAGTCAGA CTCCGAGACA GGGAGTAGAA CACGAGGCCT CCCGGGATGA 960 

GGAGCCCAGG TTGACTGGAG GCGCCGCTGG CCACCAGGAC CGCAGCAATT CAGGGCAGTA 1020 

TCCTGCAAAA GGGGGGCCCC AGCAGCCCCA TAATAAAGGC TGTGTGGCTC CCACAGCTGG 1080 

ATTGGCAGCC CTTCTGTTGG CCGTGGCTGC TGGTGTCCTA CTGTGAGCTT CTCCACCTGG 1140 

AAATTTCCCT CTCACCTACT TCTCTGGCCC TGGGTACCCC TCTTCTCATC ACTTCCTGTT 1200 

CCCACCACTG GACTGGGCTG GCCCAGCCCC TGTTTTTCCA ACATTCCCCA GTATCCCCAG 1260 

CTTCTGCTGC GCTGGTTTGC GGCTTTGGGA AATAAAATAC CGTTGTATAT ATTCTGGCAG 1320 

GGGTGTTCTA GCTTTTTGAG GACAGCTCCT GTATCCTTCT CATCCTTGTC TCTCCGCTTG 1380 

TCCTCTTGTG ATGTTAGGAC AGAGTGAGAG AAGTCAGCTG TCACGGGGAA GGTGAGAGAG 1440 

AGGATGCTAA GCTTCCTACT CACTTTCTCC TAGCCAGCCT GGACTTTGGA GCGTGGGGTG 1500 

GGTGGGACAA TGGCTCCCCA CTCTAAGCAC TGCCTCCCCT ACTCCCCGCA TCTTTGGGGA 1560 

ATCGGTTCCC CATATGTCTT CCTTACTAGA CTGTGAGCTC CTCGAGGGCA GGGACCGTGC 1620 

CTTATGTCTG TGTGTGATCA GTTTCTGGCA CATAAATGCC TCAATAAAGA TTTAATTACT 1680 
TTGTATAGTG AAAAAAAA 

Seq ID NO: 195 Protein Sequence 
Protein Accession ft: NP_055215 

1 11 21 31 41 51 

I I I I I I 

MDPARKAGAQ AMIWTAGWLL LLLLRGGAQA LECYSCVQKA DDGCSPNKMK TVKCAPGVDV 60 

CTEAVGAVET IHGQPSLAVX GCGSGLPGKN DRGLDLHGLL AFIQLQQCAQ DRCNAKLNLT 120 

SRALDPAGNE SAYPPNGVEC YSCVGLSREA CQGTSPPWS CYNASDHVYK GCFDGNVTLT 180 

AANVTVSLPV RGCVQDEFCT RDGVTGPGFT LSGSCCQGSR CNSDLRNKTY FSPRIPPLVR 240 

LPPPEPTTVA STTSVTTSTS APVRPTSTTK PMPAPTSQTP RQGVEHEASR DEEPRLTGGA 300 
AGHQDRSNSG QYPAKGGPQQ PHNKGCVAPT AGLAALLLAV AAGVLL 

Seq 10 NO: 196 DNA Sequence 
Nucleic Acid Accession #: NM_006536 
Coding sequence: 109. .2940 

1 11 21 31 41 51 

I I I I I I 

ACCTAAAACC TTGCAAGTTC AGGAAGAAAC CATCTGCATC CATATTGAAA ACCTGACACA 60 

ATGTATGCAG CAGGCTCAGT GTGAGTGAAC TGGAGGCTTC TCTACAACAT GACCCAAAGG 120 

AGCATTGCAG GTCCTATTTG CAACCTGAAG TTTGTGACTC TCCTGGTTGC CTTAAGTTCA 180 

GAACTCCCAT TCCTGGGAGC TGGAGTACAG CTTCAAGACA ATGGGTATAA TGGATTGCTC 240 

ATTGCAATTA ATCCTCAGGT ACCTGAGAAT CAGAACCTCA TCTCAAACAT TAAGGAAATG 300 

ATAACTGAAG CTTCATTTTA CCTATTTAAT GCTACCAAGA GAAGAGTATT TTTCAGAAAT 360 

ATAAAGATTT TAATACCTGC CACATGGAAA GCTAATAATA ACAGCAAAAT AAAACAAGAA 420 

TCATATGAAA AGGCAAATGT CATAGTGACT GACTGGTATG GGGCACATGG AGATGATCCA 4 80 

TACACCCTAC AATACAGAGG GTGTGGAAAA GAGGGAAAAT ACATTCATTT CACACCTAAT 540 

TTCCTACTGA ATGATAACTT AACAGCTGGC TACGGATCAC GAGGCCGAGT GTTTGTCCAT 600 

GAATGGGCCC ACCTCCGTTG GGGTGTGTTC GATGAGTATA ACAATGACAA ACCTTTCTAC 660 

ATAAATGGGC AAAATCAAAT TAAAGTGACA AGGTGTTCAT CTGACATCAC AGGCATTTTT 720 

GTGTGTGAAA AAGGTCCTTG CCCCCAAQAA AACTGTATTA TTAGTAAGCT TTTTAAAGAA 780 

GGATGCACCT TTATCTACAA TAGCACCCAA AATGCAACTG CATCAATAAT GTTCATGCAA 840 

AGTTTATCTT CTGTGGTTGA ATTTTGTAAT GCAAGTACCC ACAACCAAGA AGCACCAAAC 900 

CTACAGAACC AGATGTGCAG CCTCAGAAGT GCATGGGATG TAATCACAGA CTCTGCTGAC 960 

TTTCACCACA GCTTTCCCAT GAATGGGACT GAGCTTCCAC CTCCTCCCAC ATTCTCGCTT 1020 

GTACAGGC7G GTGACAAAGT GGTCTGTTTA GTGCTGGATG TGTCCAGCAA GATGGCAGAG 1080 

GCTGACAGAC TCCTTCAACT ACAACAAGCC GCAGAATTTT ATTTGATGCA GATTGTTGAA 1140 

ATTCATACCT TCGTGGGCAT TGCCAGTTTC GACAGCAAAG GAGAGATCAG AGCCCAGCTA 1200 

CACCAAATTA ACAGCAATGA TGATCGAAAG TTGCTGGTTT CATATCTGCC CACCACTGTA 12 60 

TCAGCTAAAA CAGACATCAG CATTTGTTCA GGGCTTAAGA AAGGATTTGA GGTGGTTGAA 1320 

AAACTGAATG GAAAAGCTTA TGGCTCTGTG ATGATATTAG TGACCAGCGG AGATGATAAG 13 BO 

CTTCTTGGCA ATTGCTTACC CACTGTGCTC AGCAGTGGTT CAACAATTCA CTCCATTGCC 1440 

CTGGGTTCAT CTGCAGCCCC AAATCTGGAG GAATTATCAC GTCTTACAGG AGGTTTAAAG 1500 

TTCTTTGTTC CAGATATATC AAACTCCAAT AGCATGATTG ATGCTTTCAG TAGAATTTCC 1560 

TCTGGAACTG GAGACATTTT CCAGCAACAT ATTCAGCTTG AAAGTACAGG TGAAAATGTC 1620 

AAACCTCACC ATCAATTGAA AAACACAGTG ACTGTGGATA ATACTGTGGG CAACGACACT 1680 

ATGTTTCTAG TTACGTGGCA GGCCAGTGGT CCTCCTGAGA TTATATTATT TGATCCTGAT 1740 

GGACGAAAAT ACTACACAAA TAATTTTATC ACCAATCTAA CTTTTCGGAC AGCTAGTCTT IB 00 

TGGATTCCAG GAACAGCTAA GCCTGGGCAC TGQACTTACA CCCTGAACAA TACCCATCAT 1860 

TCTCTGCAAG CCCTGAAAGT GACAGTGACC TCTCGCGCCT CCAACTCAGC TGTGCCCCCA 1920 

GCCACTGTGG AAGCCTTTGT GGAAAGAGAC AGCCTCCATT TTCCTCATCC TGTGATGATT 1980 

TATGCCAATG TGAAACAGGG ATTTTATCCC ATTCTTAATG CCACTGTCAC TGCCACAGTT 2040 

GAGCCAGAGA CTGGAGATCC TGTTACGCTG AGACTCCTTG ATGATGGAGC AGGTGCTGAT 2100 

GTTATAAAAA ATGATGGAAT TTACTCQAGQ TATTTTTTCT CCTTTGCTGC AAATGGTAGA 2160 

TATAGCTTGA AAGTGCATGT CAATCACTCT CCCAGCATAA GCACCCCAGC CCACTCTATT 2220 

CCAGGGAGTC ATGCTATGTA TGTACCAGGT TACACAGCAA ACGGTAATAT TCAGATGAAT 2280 

GCTCCAAGGA AATCAGTAGG CAGAAATGAG GAGGAGCGAA AGTGGGGCTT TAGCCGAGTC 2340 

AGCTCAGGAG GCTCCTTTTC AGTGCTOGGA GTTCCAGCTG GCCCCCACCC TGATGTGTTT 2400 

CCACCATGCA AAATTATTGA CCTGGAAGCT GTAAAAGTAG AAGAGGAATT GACCCTATCT 2460 

TGGACAGCAC CTGGAGAAGA CTTTGATCAG GGCCAGGCTA CAAGCTATGA AATAAGAATG 2520 
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AGTAAAAGTC TACAGAATAT CCAAGATGAC TTTAACAATG CTATTTTAGT AAATACATCA 2580 

AAGCGAAATC CTCAGCAAGC TGGCATCAGG GAGATATTTA CGTTCTCACC CCAGATTTCC 2640 

ACGAATGGAC CTGAACATCA GCCAAATGGA GAAACACATG AAAGCCACAG AATTTATGTT 2700 

GCAATACGAG CAATGGATAG GAACTCCTTA CAGTCTGCTG TATCTAACAT TGCCCAGGCG 2760 

CCTCTGTTTA TTCCCCCCAA TTCTGATCCT GTACCTGCCA GAGATTATCT TATATTGAAA 2820 

GGAGTTTTAA CAGCAATGGG TTTGATAGGA ATCATTTGCC TTATTATAGT TGTGACACAT 2880 

CATACTTTAA GCAGGAAAAA GAGAGCAGAC AAGAAAGAGA ATGGAACAAA ATTATTATAA 2940 

ATAAATATCC AAAGTGTCTT CCTTCTTAGA TATAAGACCC ATGGCCTTCG ACTACAAAAA 3000 

CATACTAACA AAGTCAAATT AACATCAAAA CTGTATTAAA ATGCATTGAG TTTTTGTACA 3060 

ATACAGATAA GATTTTTACA TGGTAGATCA ACAATTCTTT TTGGGGGTAG ATTAGAAAAC 3120 

CCTTACACTT TGGCTATGAA CAAATAATAA AAATTATTCT TTAAAGTAAT GTCTTTAAAG 3180 

GCAAAGGGAA GGGTAAAGTC GGACCAGTGT CAAGGAAAGT TTGTTTTATT GAGGTGGAAA 3240 

AATAGCCCCA AGCAGAGAAA AGGAGGGTAG GTCTGCATTA TAACTGTCTG TGTGAAGCAA 3300 

TCATTTAGTT ACTTTGATTA ATTTTTCTTT TCTCCTTATC TGTGCAGTAC AGGTTGCTTG 3360 

TTTACATGAA GATCATGCTA TATTTTATAT ATGTAGCCCC TAATGCAAAG CTCTTTACCT 3420 

CTTGCTATTT TGTTATATAT ATTTCAGATG ACATCTCCCT GCTAATGCTC AGAGATCTTT 3480 

TTTCACTGTA AGAGGTAACC TTTAACAATA TGGGTATTAC CTTTGTCTCT TCATACCGGT 3540 

TTTATGACAA AGGTCTATTG AATTTATTTG TNTGTAAGTT TCTACTCCCA TCAAAGCAGC 3600 

TTTCTAAGTT TATTGCCTTG GGTTATTATG GAATGATAGT TATAGCCCCN TATAATGCCT 3660 
TACCTAGGAA A 

Seq ID NO: 197 Protein Sequence 
Protein Accession #: NP_006527 

1 11 21 31 41 51 

I I I I I I 

MTQRSIAGPI CNLKFVTLLV ALSSELPFLG AGVQLQDNGY KGLLIAINPQ VPENQNLISN 60 

IKEMITEASF YLFNATKRRV FFRNIKILIP ATWKANNNSK IKQESYEKAN VIVTDWYGAH 120 

GDDPYTLQYR GCGKEGKYIH FTPNFLLNDN LTAGYGSRGR VFVHEWAHLR WGVFDEYNND 180 

KPFYINGQNQ IKVTRCSSDI TGIFVCEKGP CPQENCIISK LFKBGCTFIY NSTQNATASI 240 

MFMQSLSSW EFCNASTHNQ EAPNLQNQMC SLRSAWDVIT DSADFHHSFP MNGTELPPPP 300 

TFSLVQAGDK WCLVLDVSS KMAEADRLLQ LQQAAEFYLM QIVEIHTFVG IASFDSKGEI 360 

RAQLHQINSN DDRKLLVSYL PTTVSAKTDI SICSGLKKGF EWEKLNGKA YGSVMILVTS 420 

GDDKLU3NCL PTVLSSGSTI HSIALGSSAA PNLEELSRLT GGLKFFVPDI SNSNSMIDAF 4 B0 

SRISSGTGDI FQQHIQLEST GENVKPHHQL KNTVTVDNTV GNDTMFLVTW QASGPPEIIL 540 

FDPDGRKYYT NNFITNLTFR TASLWIPGTA KPGHWTYTLN NTHHSLQALK VTVTSRASNS 600 

AVPPATVEAF VERDSLHFPH PVMIYANVKQ GFYPILNATV TATVEPETGD PVTLRLLDDG 660 

AGADVIKNDG IYSRYFFSFA ANGRYSLKVH VNHSPSISTP AHSIPGSHAM YVPGYTANGN 720 

IQMNAPRKSV GRNEEERKWG FSRVSSGGSF SVLGVPAGPH PDVFPPCKII DLEAVKVEEE 780. 

LTLSWTAPGE DFDQGQATSY EIRMSKSLQN IQDDFNNAIL VNTSKRNPQQ AGIREIFTFS 840 

PQISTNGPEH QPNGETHESH RIYVAIRAMD RNSLQSAVSN IAQAPLFIPP NSDPVPARDY 900 
LILKGVLTAM GLIGIICLII WTHHTLSRK KRADKKENGT KLL 



Seq ID NO: 198 DNA Sequence 
Nucleic Acid Accession #: NM_001944 
Coding sequence: 84.. 3083 

1 11 21 31 41 51 

I I I I I I 

TTTTCTTAGA CATTAACTGC AGACGGCTGG CAGGATAGAA GCAGCGGCTC ACTTGGACTT 60 

TTTCACCAGG GAAATCAGAG ACAATGATGG GGCTCTTCCC CAGAACTACA GGGGCTCTGG 120 

CCATCTTCGT GGTGGTCATA TTGGTTCATG GAGAATTGCG AATAGAGACT AAAGGTCAAT 1B0 

ATGATGAAGA AGAGATGACT ATGCAACAAG CTAAAAGAAG GCAAAAACGT GAATGGGTGA 240 

AATTTGCCAA ACCCTGCAGA GAAGGAGAAG ATAACTCAAA AAGAAACCCA ATTG CCAAGA 300 

TTACTTCAGA TTACCAAGCA ACCCAGAAAA TCACCTACCG AATCTCTGGA GTGGGAATCG 360 

ATCAGCCGCC TTTTGGAATC TTTGTTGTTG ACAAAAACAC TGGAGATATT AACATAACAG 420 

CTATAGTCGA CCGGGAGGAA ACTCCAAG CT TCCTGATCAC ATGTCGGGCT CTAAATGCCC 4 80 

AAGGACTAGA TGTAGAGAAA CCACTTATAC TAACGGTTAA AATTTTGGAT ATTAATGATA 540 

ATCCTCCAGT ATTTTCACAA CAAATTTTCA TGGGTGAAAT TGAAGAAAAT AGTGCCTCAA 600 

ACTCACTGGT GATGATACTA AATGCCACAG ATGCAGATGA ACCAAACCAC TTGAATTCTA 660 

AAATTGCCTT CAAAATTGTC TCTCAGGAAC CAGCAGGCAC ACCCATGTTC CTCCTAAGCA 720 

GAAACACTGG GGAAGTCCGT ACTTTGACCA ATTCTCTTGA CCGAGAGCAA GCTAGCAGCT 780 

ATCGTCTGGT TGTGAGTGGT GCAGACAAAG ATGGAGAAGG ACTATCAACT CAATGTGAAT 840 

GTAATATTAA' AGTGAAAGAT GTCAACGATA ACTTCCCAAT GTTTAGAGAC TCTCAGTATT 900 

CAGCACGTAT TGAAGAAAAT ATTTTAAGTT CTGAATTACT TCGATTTCAA GTAACAGATT 960 

TGGATGAAGA GTACACAGAT AATTGGCTTG CAGTATATTT CTTTACCTCT GGGAATGAAG 1020 

GAAATTGGTT TGAAATACAA ACTGATCCTA GAACTAATGA AGGCATCCTG AAAGTGGTGA 1080 

AGGCTCTAGA TTATGAACAA CTACAAAGCG TGAAACTTAG TATTGCTGTC AAAAACAAAG 1140 

CTGAATTTCA CCAATCAGTT ATCTCTCGAT ACCGAGTTCA GTCAACCCCA GTCACAATTC 1200 

AGGTAATAAA TGTAAGAGAA GGAATTGCAT TCCGTCCTGC TT CCAAGA CA TTTACTGTGC 1260 

AAAAAGGCAT AAGTAGCAAA AAATTGGTGG ATTATATCCT GGGAACATAT CAAGCCATCG 1320 

ATGAGGACAC TAACAAAGCT GCCTCAAATG TCAAATATGT CATGGGACGT AACGATGGTG 1380 

GATACCTAAT GATTGATTCA AAAACTGCTG AAATCAAATT TGTCAAAAAT ATGAACCGAG 1440 

ATTCTACTTT CATAGTTAAC AAAACAATCA CAGCTGAGGT TCTGGCCATA GATGAATACA 1500 

CGGGTAAAAC TTCTACAGGC ACGGTATATG TTAGAGTACC CGATTTCAAT GACAATTGTC 1560 

CAACAGCTGT CCTCGAAAAA GATGCAGTTT GCAGTTCTTC ACCTTCCGTG GTTGTCTCCG 1620 

CTAGAACACT GAATAATAGA TACACTGGCC CCTATACATT TGCACTGGAA GATCAACCTG 1680 
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TAAAGTTGCC TGCCGTATGG AGTATCACAA CCCTCAATGC TACCTCGGCC CTCCTCAGAG 1740 

CCCAGGAACA GATACCTCCT GQAGTATACC ACATCTCCCT GGTACTTACA GACAGTCAGA 1BOO 

ACAATCGGTG TGAGATGCCA CGCAGCTTGA CACTGGAAGT CTGTCAGTGT GACAACAGGG I860 

GCATCTGTGG AACTTCTTAC CCAACCACAA GCCCTGGGAC CAGGTATGGC AGGCCGCACT 1920 

CAGGGAGGCT GGGGCCTGCC GCCATCGGCC TGCTGCTCCT TGGTCTCCTG CTGCTGCTGT 1980 

TGGCCCCCCT TCTGCTGTTG ACCTGTGACT GTGGGGCAGG TTCTACTGGG GGAGTGACAG 2040 

GTGGTTTTAT CCCAGTTCCT GATGGCTCAG AAGGAACAAT TCATCAGTGG GGAATTGAAG 2100 

GAGCCCATCC TGAAGACAAG GAAATCACAA ATATTTGTGT GCCTCCTGTA ACAGCCAATG 2 ISO 

GAGCCGATTT CATGGAAAGT TCTGAAGTTT GTACAAATAC GTATGCCAGA GGCACAGCGG 2220 

TGGAAGGCAC TTCAGGAATG GAAATGACCA CTAAGCTTGG AGCAGCCACT QAATCTGGAG 2280 

GTGCTGCAGG CTTTGCAACA GGGACAGTGT CAGGAGCTGC TTCAGGATTC GOAGCAGCCA 2340 

CTGGAGTTGG CATCTGTTCC TCAGGGCAGT CTGGAACCAT GAGAACAAGG CATTCCACTG 2400 

GAGGAACCAA TAAGGACTAC GCTGATGGGG CGATAAGCAT GAATTTTCTG GACTCCTACT 2460 

TTTCTCAGAA AGCATTTGCC TGTGCGGAGG AAGACGATGG CCAGQAAGCA AATGACTGCT 2520 

TGTTGATCTA TGATAATGAA GGCGCAGATG CCACTGGTTC TCCTGTGGGC TCCGTGGGTT 25B0 

GTTGCAGTTT TATTGCTGAT GACCTGGATG ACAGCTTCTT GGACTCACTT GGACCCAAAT 2640 

TTAAAAAACT TGCAGAGATA AGCCTTGGTG TTGATGGTGA AGGCAAAGAA GTTCAGCCAC 2700 

CCTCTAAAGA CAGCGGTTAT GGGATTGAAT CCTGTGGCCA TCCCATAGAA GTCCAGCAGA 2760 

CAGGATTTGT TAAGTGCCAG ACTTTGTCAG GAAGTCAAGG AGCTTCTGCT TTGTCCGCCT 2820 

CTGGGTCTGT CCAGCCAGCT GTTTCCATCC CTGACCCTCT GCAGCATGGT AACTATTTAG 2880 

TAACGGAGAC TTACTCGGCT TCTGGTTCCC TCGTGCAACC TTCCACTGCA GGCTTTGATC 2940 

CACTTCTCAC ACAAAATGTG ATAGTGACAG AAAGGGTGAT CTGTCCCATT TCCAGTGTTC 3000 

CTGGCAACCT AGCTGGCCCA ACGCAGCTAC GAGGGTCACA TACTATGCTC TGTACAGAGG 3060 

ATCCTTGCTC CCGTCTAATA TGACCAGAAT GAGCTGGAAT ACCACACTGA CCAAATCTGG 3120 

ATCTTTGGAC TAAAGTATTC AAAAT AG CAT AGCAAAGCTC ACTGTATTGG G CTAATAATT 3180 

TGGCACTTAT TAGCTTCTCT CATAAACTGA TCACGATTAT AAATTAAATG TTTGGGTTCA 3240 

TACCCCAAAA GCAATATGTT GTCACTCCTA ATTCTCAAGT ACTATTCAAA TTGTAGTAAA 3300 
TCTTAAAGTT TTTCAAAACC CT AAAAT CAT ATTCGC 

Seq ID NO: 199 Protein Sequence 
Protein Accession flt NP_001935 

1 11 21 31 41 51 

I I I I I t 

MMGLFPRTTG ALAIFVWIL VHGELRIETK GQYDEEBMTM QQAKRRQKRE WVKFAKPCRE 60 

GEDNSKRNPI AKITSDYQAT QKITYRISGV GIDQPPFGIF WDKNTGDIN ITAIVDREET 120 

PSFLITCRAL NAQGLDVEKP LILTVKILDI NDNPPVFSQQ IFMGEIEENS ASNSLVMILN 180 

ATDADEPNHL NSKIAFKIVS QEPAGTPMFL LSRNTGEVRT LTNSLDREQA SSYRLWSGA 240 

DKDGEGL5TQ CECNIKVKDV NDNFPMFRDS QYSARIEENI LSSELLRFQV TDLDEEYTDN 300 

WLAVYFFTSG NEGNWFEIQT DPRTNEGILK WKALDYEQL QSVKLSIAVK NKAEFHQSVI 360 

SRYRVQSTPV TIQVINVREG IAFRPASKTF TVQKGISSKK LVDYILGTYQ AIDEDTNKAA 420 

SNVKYVMGRN DGGYLMIDSK TAEIKFVKNM NRDSTFIVNK TITAEVLAID EYTGKTSTGT 480 

VYVRVPDFND NCPTAVLEKD AVCSSSPSW VSARTLNNRY TGPYTFALED QPVKLPAVWS 540 

ITTLNATSAL LRAQEQIPPG VYHISLVLTD SQNNRCEMPR SLTLEVCQCD NRGICGTSYP 600 

TTSPGTRYGR PHSGRLGPAA IGLLLLGLLL LLLAPLLLLT CDCGAGSTGG VTGGFIPVPD 660 

GSEGTIHQWG IEGAHPEDKE ITNICVPPVT ANGADFMESS EVCTNTYARG TAVEGTSGME 720 

MTTKLGAATE SGGAAGFATG TVSGAASGPG AATGVGICSS GQSGTMRTRH STGGTNKDYA 780 

DGAISMNFLD SYFSQKAFAC AEEDDGQEAN DCLLIYDNEG ADATGSPVGS VGCCSFIADD 840 

LDDSFLDSLG PKFKKLAEIS LGVDGEGKEV QPPSKDSGYG IESCGHPIEV QQTGFVKCQT 900 

LSGSQGASAL SASGSVQPAV SIPDPLQHGN YLVTETYSAS GSLVQPSTAQ FDPLLTQNVI 960 
VTERVICPIS SVPGNLAGPT QLRGSHTMLC TEDPCSRLI 



Seq ID NO: 200 DNA Sequence 
Nucleic Acid Accession 8; NM_020411 
Coding sequence: 86.. 526 

1 11 21 31 41 51 

I I I I I I 

GGACCTGGQA AGGAGCATAG GACAGGGCAA GGCGGGATAA GGAGGGGCAC CACAGCCCTT 60 
AAGGCACGAG GGAACCTCAC TGCGCATGCT CCTTTGGTGC CCACCTCAGT GCGCATGTTC 120 
ACTGGGCGTC TTCCCATCGG CCCCTTCGCC AGTGTGGGGA ACGCGGCGGA GCTGTGAGCC 180 
GQCGACTCGG GTCCCTGAGG TCTGGATTCT TTCTCCGCTA CTGAGACACG GCGGACACAC 240 
ACAAACACAG AACCACACAG CCAGTCCCAG GAGCCCAGTA ATGGAGAGCC CCAAAAAGAA 300 
GAACCAGCAG CTGAAAGTCG GGATCCTACA CCTGGGCAGC AGACAGAAGA AGATCAGGAT 360 
ACAGCTGAGA TCCCAGTGCG CGACATGGAA GGTGATCTGC AAGAGCTGCA TCAGTCAAAC 420 
ACCGGGGATA AATCTGGATT TGGGTTCCGG CGTCAAGGTG AAGATAATAC CTAAAGAGGA 480 
ACACTGTAAA ATGCCAGAAG CAGGTGAAGA GCAACCACAA GTTTAAATGA AGACAAGCTG 540 
AAACAACGCA AGCTGGTTTT ATATTAGATA TTTGACTTAA ACTATCTCAA TAAAGTTTTG 600 
CAGCTTTCAC CAAAAAAAAA AAAAAA 

Seq ID NO i 201 Protein Sequence 
Protein Accession ft: NP_065144 . 1 

1 ii 21 31 41 51 

icLLWCPPQCA CSLGVFPSAP SPVWGTRRSC EPATRVPEVW ILSPLLRHGG HTQTQNBTAS 60 
PRSPVMESPK KKNOGLKVGI LHLGSRQKKI RIQLRSQCAT WKVICKSCIS QTPGINLDLG 120 
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SGVKVKIIPK EEHCKMPEAG EEQPQV 

Seq ID NO i 202 DNA Sequence 
Nucleic Acid Accession th AA172056 
Coding sequence: 121.. 339 

1 11 21 31 41 51 

I I I I ! I 

TTTAGCCACC AGAGGANTTC TCTTGAAATA CCCAAAATCC ATCAGTATCT TGAATCATGC 60 

TGQATTTTGA AGAATTCTTA AGAAGCCATG TAAAGGGGGC TCTCTGGCCT TGAAATAGTG 120 

ATGTTTTTTA TACAGAAAGG AGAATGCAGA ATGGTCAGAC TATCATGCAC TGTTAAATTT 180 

GATTTCAAGA AATTACAGGA AAACTTTCCA AAGTTCCATC TCACAGAANN TTATTTTNCC 240 

AAGAATTCCA AGATAAGTTT AGTTTTATGG AAGACTTTTA TGTGGTTTTT ACTCACTCTT 300 

CATCTCAGAC ATCGACAGAT GATTACATCA CTTATAGTTC TAGTAAATTT ATTAATATAA 360 

AACTCAGAGA CATTCCAATA TCCACATTGC TTACACCATT AGGCATAGAT TCAGTGTCAG 420 

CTATGACAAT TGAAAATGAG CTGTTTTGTG ATTTAAAGGT TTAAATTTCT CTAACCAAAC 480 

TGCTTGATCC AGATGCAGGA CTGCAAATGT TAATATTTGT TCTGGAAGAA CAATCAAATA 540 

AGACTTAAGA GGAAAGGGAA TGGCCACAAT CCACCTGAAA TTTTTTCTTA AAAAGTGTGC 600 

AGCCTACTAA ATCAGAATGA AAATAGAAGT ACAAGATTAT AAACAAAATG CAATCAAACT 660 

TTTCTTAAGC TTACCTAAAG TTATTTCATC TGAAAATTTC AAGCAACTTT GTTCAACATT 720 

AAATTGACAA TCTAAACTAA CAAGTCTTTT GAATTTATGC ATGGTAGTAA ACATTCTCTC 780 

TATTAACTTT ATTACCTAAG GCTAAACCTA AAATTTTTAA GCAAAATTAG AAAAATAGTC 840 

TTCACTCATC AAAAAATAAA GTTTGTTACA TTTAGTATTT TCCCAATAAA ATTGGTCGTT 900 

CTTGGTTTTT TATTTGGAGA GTCTGTGCAA AATGTCACTA AAAATAAATT AGCACTAGAA 960 
ATTATTTCTA AATACCAAA 

Seq ID NO: 203 DNA Sequence 

Nucleic Acid Accession #: NM_0056S6.1 

Coding sequence: 57.. 1535 

1 11 21 31 41 51 

1 I I I ! I 

GTCATATTGA ACATTCCAGA TACCTATCAT TACTCGATGC TGTTGATAAC AGCAAGATGG 60 

CTTTGAACTC AGGGTCACCA CCAGCTATTG GACCTTACTA TGAAAACCAT GGATACCAAC 120 

CGGAAAACCC CTATCCCGCA CAGCCCACTG TGGTCCCCAC TGTCTACGAG GTGCATCCGG 180 

CTCAGTACTA CCCGTCCCCC GTGCCCCAGT ACGCCCCGAG GGTCCTGACG CAGGCTTCCA 240 

ACCCCGTCGT CTGCACGCAG CCCAAATCCC CATCCGGGAC AGTGTGCACC TCAAAGACTA 300 

AGAAAGCACT GTGCATCACC TTGACCCTGG GGACCTTCCT CGTGGGAGCT GCGCTGGCCG 360 

CTGGCCTACT CTGGAAGTTC ATGGGCAGCA AGTGCTCCAA CTCTGGGATA GAGTGCGACT 420 

CCTCAGGTAC CTGCATCAAC CCCTCTAACT GGTGTGATGG CGTGTCACAC TGCCCCGGCG 480 

GGGAGGACGA GAATCGGTGT GTTCGCCTCT ACGGACCAAA CTTCATCCTT CAGATGTACT 540 

CATCTCAGAG GAAGTCCTGG CACCCTGTGT GCCAAGACGA CTGGAACGAG AACTACGGGC 600 

GGGCGGCCTG CAGGGACATG GGCTATAAGA ATAATTTTTA CTCTAGCCAA GGAATAGTGG 660 

ATGACAGCGG ATCCACCAGC TTTATGAAAC TGAACACAAG TGCCGGCAAT GTCGATATCT 720 

ATAAAAAACT GTACCACAGT GATGCCTGTT CTTCAAAAGC AGTGGTTTCT TTACGCTGTT 780 

TAGCCTGCGG GGTCAACTTG AACTCAAGCC GCCAGAGCAG GATCGTGGGC GGTGAGAGCG 840 

CGCTCCCGGG GGCCTGGCCC TGGCAGGTCA GCCTGCACGT CCAGAACGTC CACGTGTGCG 900 

GAGGCTCCAT CATCACCCCC GAGTGGATCG TGACAGCCGC CCACTGCGTG GAAAAACCTC 960 

TTAACAATCC ATGGCATTGG ACGGCATTTG CGGGGATTTT GAGACAATCT TTCATGTTCT 1020 

ATGGAGCCGG ATACCAAGTA CAAAAAGTGA TTTCTCATCC AAATTATGAC TCCAAGACCA 1080 

AGAACAATGA CATTGCGCTG ATGAAGCTGC AGAAGCCTCT GACTTTCAAC GACCTAGTGA 1140 

AACCAGTGTG TCTGCCCAAC CCAGGCATGA TGCTGCAGCC AGAACAGCTC TGCTGGATTT 1200 

CCGGGTGGGG GGCCACCGAG GAGAAAGGGA AGACCTCAGA AGTGCTGAAC GCTGCCAAGG 1260 

TGCTTCTCAT TGAGACACAG AGATGCAACA GCAGATATGT CTATGACAAC CTGATCACAC 1320 

CAGCCATGAT CTGTGCCGGC TTCCTGCAGG GGAACGTCGA TTCTTGCCAG GGTGACAGTG 1380 

GAGGGCCTCT GGTCACTTCG AACAACAATA TCTGGTGGCT GATAGGGGAT ACAAGCTGGG 1440 

GTTCTGGCTG TGCCAAAGCT TACAGACCAG GAGTGTACGG GAATGTGATG GTATTCACGG 1500 

ACTGGATTTA TCGACAAATG AAGGCAAACG GCTAATCCAC ATGGTCTTCG TCCTTGACGT 1560 

CGTTTTACAA GAAAACAATG GGGCTGGTTT TGCTTCCCCG TGCATGATTT ACTCTTAGAG 1620 

ATGATTCAGA GGTCACTTCA TTTTTATTAA ACAGTGAACT TGTCTGGCTT TGGCACTCTC 1680 

TGCCATACTG TGCAGGCTGC AGTGGCTCCC CTGCCCAGCC TGCTCTCCCT AACCCCTTGT 1740 

CCGCAAGGGG TGATGGCCGG CTGGTTGTGG GCACTGGCGG TCAATTGTGG AAGGAAGAGG 1800 

GTTGGAGGCT GCCCCCATTG AGATCTTCCT GCTGAGTCCT TTCCAGGGGC CAATTTTGGA 1860 

TGAGCATGGA GCTGTCACTT CTCAGCTGCT GGATGACTTG AGATGAAAAA GGAGAGACAT 1920 

GGAAAGGGAG ACAGCCAGGT GGCACCTGCA GCGGCTGCCC TCTGGGGCCA CTTGGTAGTG 1980 

TCCCCAGCCT ACTTCACAAG GGGATTTTGC TGATGGGTTC TTAGAGCCTT AGCAGCCCTG 2040 

GATGGTGGCC AGAAATAAAG GGACCAGCCC TTCATGGGTG GTGAOGTGGT AGTCACTTGT 2100 

AAGGGGAACA GAAACATTTT TGTTCTTATG GGGTGAGAAT ATAGACAGTG CCCTTGGTGC 2160 

GAGGGAAGCA ATTGAAAAGG AACTTGCCCT GAGCACTCCT GGTGCAGGTC TCCACCTGCA 2220 

CATTGGGTGG GGCTCCTGGG AGGGAGACTC AGCCTTCCTC CTCATCCTCC CTGACCCTGC 2280 

TCCTAGCACC CTGGAGAGTG AATGCCCCTT GGTCCCTGGC AGGGCGCCAA GTTTGGCACC 2340 

ATGTCGGCCT CTTCAGGCCT GATAGTCATT GGAAATTGAG GTCCATGGGG GAAATCAAGG 2400 

ATGCTCAGTT TAAGGTACAC TGTTTCCATG TTATGTTTCT ACACATTGAT GGTGGTGACC 2460 
CTGAGTTCAA AGCCATCTT 

Seq ID NO: 204 Protein Sequence 
Protein Accession «: NPJ>05647.1 
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1 11 21 31 41 51 

I I I I I I 

MALNSGSPPA IGPYYENHGY QPENPYPAQP TWPTVYEVH PAQYYPSPVP QYAPRVLTQA 60 

5 SNPWCTQPK SPSGTVCTSK TKKALCITLT LGTFLVGAAL AAGLLWKFMG SKCSNSGIEC 120 

DSSGTCINPS NWCDGVSHCP GGEDENRCVR LYGPNFILQM YSSQRKSWHP VCQDDWNENY 180 

GRAACRDMGY KNNFYSSQGI VDDSGSTSFM KLNTSAGNVD IYRKLYHSDA CSSKAWSLR 240 

CLACGVNLNS SRQSRIVGGE SALPOAWPWQ VSUIVQNVHV CGGSIITPEW IVTAAHCVEK 300 

PLNNPWHWTA FAGILRQSFM FYGAGYQVQK VISHPNYDSK TKNNDIALMK LQKPLTFNDL 360 

10 VKPVCLPNPG MMLQPEQLCW ISGWGATEEK GKTSEVLNAA KVLLIETQRC NSRYVYDNLI 420 

TPAMICAGFL QGNVDSCQGD SGGPLVTSNN NIWWLIGDTS WGSGCAKAYR PGVYGNVMVP 480 
TDWIYRQMKA NG 

Seq ID NO: 205 DNA Sequence 
15 Nucleic Acid Acceseion ft: XM_044533 
Coding sequence: 238.. 2751 

1 11 21 31 41 51 

on I I I I I I 

Z\J GCTCTGCCCA AGCCGAGGCT GCGGGGCCGG CGCCGGCGGG AGGACTGCGG TGCCCCGCGG 60 

AGGGGCTGAG TTTGCCAGGG CCCACTTGAC CCTGTTTCCC ACCTCCCGCC CCCCAGGTCC 120 

GGAGGCGGGG GCCCCCGGGG CGACTCGGGG GCGGACCGCG GGGCGGAGCT GCCGCCCGTG 180 

AGTCCGGCCG AGCCACCTGA GCCCGAGCCG OGGGACACCG TCGCTCCTGC TCTCCGAATG 240 

CTGCGCACCG CGATGGGCCT GAGGAGCTGG CTCGCCGCCC CATGGGGCGC GCTGCCGCCT 300 

25 CGGCCACCGC TGCTGCTGCT CCTGCTGCTG CTGCTCCTGC TGCAGCCGCC GCCTCCGACC 360 

TGGGCGCTCA GCCCCCGGAT CAGCCTGCCT CTGGGCTCTG AAGAGCGGCC ATTCCTCAGA 42 0 

TTCGAAGCTG AACACATCTC CAACTACACA GCCCTTCTGC TGAGCAGGGA TGGCAGGACC 480 

CTGTACGTGG GTGCTCGAGA GGCCCTCTTT GCACTCAGTA GCAACCTCAG CTTCCTGCCA 540 

GGCGGGGAGT ACCAGGAGCT GCTTTGGGGT GCAGACGCAG AGAAGAAACA GCAGTGCAGC 600 

30 TTCAAGGGCA AGGACCCACA GCGCGACTGT CAAAACTACA TCAAGATCCT CCTGCCGCTC 660 

AGCGGCAGTC ACCTGTTCAC CTGTGGCACA GCAGCCTTCA GCCCCATGTG TACCTACATC 720 

AACATGGAGA ACTTCACCCT GGCAAGGGAC GAGAAGGGGA ATGTCCTCCT GGAAGATGGC 780 

AAGGGCCGTT GTCCCTTCGA CCCGAATTTC AAGTCCACTG CCCTGGTGGT TGATGGCGAG 840 

CTCTACACTG GAACAGTCAG CAGCTTCCAA GGGAATGACC CGGCCATCTC GCGGAGCCAA 900 

35 AGCCTTCGCC CCACCAAGAC CGAGAGCTCC CTCAACTGGC TGCAAGACCC AGCTTTTGTG 960 

GCCTCAGCCT ACATTCCTGA GAGCCTGGGC AGCTTGCAAG GCGATGATGA CAAGATCTAC 1020 

TTTTTCTTCA GCGAGACTGG CCAGGAATTT GAGTTCTTTG AGAACACCAT TGTGTCCCGC 1080 

ATTGCCCGCA TCTGCAAGGG CGATGAGGGT GGAGAGCGGG TGCTACAGCA GCGCTGGACC 114 0 

TCCTTCCTCA AGGCCCAGCT GCTGTGCTCA CGGCCCGACG ATGGCTTCCC CTTCAACGTG 1200 

40 CTGCAGGATG TCTTCACGCT GAGCCCCAGC CCCCAGGACT GGCGTGACAC CCTTTTCTAT 1260 

GGGGTCTTCA CTTCCCAGTG GCACAGGGGA ACTACAGAAG GCTCTGCCGT CTGTGTCTTC 1320 

ACAATGAAGG ATGTGCAGAG AGTCTTCAGC GGCCTCTACA AGGAGGTGAA CCGTGAGACA 1380 

CAGCAGTGGT ACACCGTGAC CCACCCGGTG CCCACACCCC GGCCTGGAGC GTGCATCACC 1440 

AACAGTGCCC GGGAAAGGAA GATCAACTCA TCCCTGCAGC TCCCAGACCG CGTGCTGAAC 1500 

45 TTCCTCAAGG ACCACTTCCT GATGGACGGG CAGGTCCGAA GCCGCATGCT GCTGCTGCAG 1560 

CCCCAGGCTC GCTACCAGCG CGTGGCTGTA CACCGCGTCC CTGGCCTGCA CCACACCTAC 1620 

GATGTCCTCT TCCTGGGCAC TGGTGACGGC CGGCTCCACA AGGCAGTGAG CGTGGGCCCC 1680 

CGGGTGCACA TCATTGAGGA GCTGCAGATC TTCTCATCGG GACAGCCCGT GCAGAATCTG 1740 

CTCCTGGACA CCCACAGQGG GCTGCTGTAT GCGGCCTCAC ACTCGGGCGT AGTCCAGGTG 1800 

50 CCCATGGCCA ACTGCAGCCT GTACAGGAGC TGTGGGGACT GCCTCCTCGC CCGGGACCCC 1860 

TACTGTGCrr GGAGCGGCTC CAGCTGCAAG CACGTCAGCC TCTACCAGCC TCAGCTGGCC 1920 

ACCAGGCCGT GGATCCAGGA CATCGAGGGA GCCAGCGCCA AGGACCTTTG CAGCGCGTCT 1980 

TCGGTTGTGT CCCCGTCTTT TGTACCAACA GGGGAGAAGC CATGTGAGCA AGTCCAGTTC 2040 

CAGCCCAACA CAGTGAACAC TTTGGCCTGC CCGCTCCTCT CCAACCTGGC GACCCGACTC 2100 

55 TGGCTACGCA ACGGGGCCCC CGTCAATGCC TCGGCCTCCT GCCACGTGCT ACCCACTGGG 2160 

GACCTGCTGC TGGTGGGCAC CCAACAGCTG GGGGAGTTCC AGTGCTGGTC ACTAGAGGAG 2220 

GGCTTCCAGC AGCTGGTAGC CAGCTACTGC CCAGAGGTGG TGGAGGACGG GGTGGCAGAC 2280 

CAAACAGATG AGGGTGGCAG TGTACCCGTC ATTATCAGCA CATCGCGTGT GAGTGCACCA 2340 

GCTGGTGGCA AGGCCAGCTG GGGTGCAGAC AGGTCCTACT GGAAGGAGTT CCTGGTGATG 2400 

60 TGCACGCTCT TTGTGCTGGC CGTGCTGCTC CCAGTTTTAT TCTTGCTCTA CCGGCACCGG 2460 

AACAGCATGA AAGTCTTCCT GAAGCAGGGG GAATGTGCCA GCGTGCACCC CAAGACCTGC 2520 

CCTGTGGTGC TGCCCCCTGA GACCCGCCCA CTCAACGGCC TAGGGCCCCC TAGCACCCCG 2580 

CTCGATCACC GAGGGTACCA GTCCCTGTCA GACAGCCCCC CGGGGTCCCG AGTCTTCACT 2640 

GAGTCAGAGA AGAGGCCACT CAGCATCCAA GACAGCTTCG TGGAGGTATC CCCAGTGTGC 2700 

65 CCCCGGCCCC GGGTCCGCCT TGGCTCGGAG ATCCGTGACT CTGTGGTGTG AGAGCTGACT 2760 

TCCAGAGGAC GCTGCCCTGG CTTCAGGGGC TGTGAATGCT CGGAGAGGGT CAACTGGACC 2B20 

TCCCCTCCGC TCTGCTCTTC GTGGAACACG ACCGTGGTGC CCGGCCCTTG GGAGCCTTGG 2880 

GGCCAGCTGG CCTGCTGCTC TCCAGTCAAG TAGCGAAGCT CCTACCACCC AGACACCCAA 2940 

ACAGCCGTGG CCCCAGAGGT CCTGGCCAAA TATGGGGGCC TGCCTAGGTT GGTGGAACAG 3000 

70 TGCTCCTTAT GTAAACTGAG CCCTTTGTTT AAAAAACAAT TCCAAATGTG AAACTAGAAT 3060 

GAGAGGGAAG AGATAGCATG GCATGCAGCA CACACGGCTG CTCCAGTTCA TGGCCTCCCA 3120 

GGGGTGCTGG GGATGCATCC AAAGTGGTTG TCTGAGACAG AGTTGGAAAC CCTCACCAAC 3180 

TGGCCTCTTC ACCTTCCACA TTATCCCGCT GCCACCGGCT GCCCTGTCTC ACTGCAGATT 3240 

CAGGACCAGC TTGGGCTGCG TGCGTTCTGC CTTGCCAGTC AGCCGAGGAT GTAGTTGTTG 3300 

75 CTGCCGTCGT CCCACCACCT CAGGGACCAG AGGGCTAGGT TGGCACTGCG GCCCTCACCA 3360 

GGTCCTGGGC TCGGACCCAA CTCCTGGACC TTTCCAGCCT GTATCAGGCT GTGGCCACAC 3420 

GAGAGGACAG CGCGAGCTCA GGAGAGATTT CGTGACAATG TACGCCTTTC CCTCAGAATT 3480 

CAGGGAAGAG ACTGTCGCCT GCCTTCCTCC GTTGTTGCGT GAGAACCCGT GTGCCCCTTC 3540 
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CCACCATATC CACCCTCGCT CCATCTTTGA ACTCAAACAC GAGGAACTAA CTGCACCCTQ 3600 

GTCCTCTCCC CAGTCCCCAG TTCACCCTCC ATCCCTCACC TTCCTCCACT CTAAGGGATA 3660 

TCAACACTGC CCAGCACAGG GGCCCTGAAT TTATGTGGTT TTTATACATT TTTTAATAAG 3720 
ATGCACTTTA TGTCATTTTT TAATAAAGTC TGAAGAATTA CTGTTT 

Seq ID NO i 206 Protein Sequence 
Protein Accession #t XP_044533.6 

1 11 21 31 41 51 

I I ( I I I 

MLRTAMGLRS WLAAPWGALP PRPPLLLLLL LLLLLQPPPP TWALSPRISL PLGSEERPFL 60 

RPEAEHISNY TALLLSRDGR TLYVGAREAL FALSSNLSFL PGGEYQELLW GADAEKKQQC 120 

SFKGKDPQRD CQNYIKILLP LSGSHLFTCG TAAFSPMCTY INMENPTLAR DEKGNVLLED 180 

GKGRCPFDPN PKSTALWDG ELVTGTVSSP QGNDPAISRS QSLRPTKTES SLNWLQDPAP 240 

VASAYIPESL GSLQGDDDKI YFFFSETGQE FEFFENTIVS RIARICKGDE GGERVLQQRW 300 

TSFLKAQLLC SRPDDGPPPN VLQDVFTLSP SPQDWRDTLF YGVFTSQWHR GTTEGSAVCV 360 

FTMKDVQRVF SGLYKEVNRE TQQWYTVTHP VPTPRPGACI TNSARERKIN SSLQLPDRVL 420 

NFLKDHFLMD GQVRSRMLLL QPQARYQRVA VHRVPGLHHT YDVLFLGTGD GRLHKAVSVG 480 

PRVHI IEELQ IFSSGQPVQN LLLDTHRGLL YAASHSGWQ VPMANCSLYR SCGDCLLARD 540 

PYCAHSGSSC KHVSLYQPQL ATRPWIQDIE GASAKDLCSA SSWSPSFVP TGEKPCEQVQ 600 

FQPNTVNTLA CPLLSNLATR LWLRNGAPVN ASASCHVLPT GDLLLVGTQQ LGEFQCWSLE 660 

EGFQQLVASY CPEWEDGVA DQTDEGGSVP VIISTSRVSA PAGGKASWGA DRSYWKEFLV 720 

MCTLPVLAVL LPVLFLLYRH RNSMKVFLKQ GECASVHPKT CPWLPPETR PLNGLGPPST 780 
PLDHRGYQSL SDSPPGSRVF TESEKRPLSI QDSFVEVSPV CPRPRVRLGS EIRDSW 
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It is understood that the examples described above in no way serve to limit the true 
scope of this invention, but rather are presented for illustrative purposes. All publications, 
sequences of accession numbers, and patent applications cited in this specification are herein 
5 incorporated by reference as if each individual publication or patent application were 
specifically and individually indicated to be incorporated by reference. 
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WHAT IS CLAIMED IS: 



1 . A method of detecting a bladder cancer-associated transcript in a cell from a patient, 
5 the method comprising contacting a biological sample from the patient with a polynucleotide 

that selectively hybridizes to a sequence at least 80% identical to a sequence as shown in 
Tables 1A-13. 

2. The method of Claim I , wherein the biological sample comprises isolated nucleic 
10 acids. 



3 . The method of Claim 2 : 

a) wherein the nucleic acids are mRNA; or 

b) further comprising the step of amplifying nucleic acids before the step of contacting 
1 5 the biological sample with the polynucleotide. 



20 



The method of Claim 1, wherein the polynucleotide: 

a) comprises a sequence as shown in Tables 1A-13; or 

b) is immobilized on a solid surface. 

The method of Claim 1, wherein the patient is: 

a) undergoing a therapeutic regimen to treat bladder cancer; or 

b) suspected of having bladder cancer. 



25 6. An isolated nucleic acid molecule consisting of a polynucleotide sequence as shown 
in Tables 1A-13. 

7. The nucleic acid molecule of Claim 6, which is labeled. 



30 8. An expression vector comprising the nucleic acid of Claim 7. 
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9. A host cell comprising the expression vector of Claim 8. 

10. An isolated polypeptide which is encoded by a nucleic acid molecule having 
5 polynucleotide sequence as shown in Tables 1A-13. 

11. An antibody that specifically binds a polypeptide of Claim 10. 

12. The antibody of Claim 1 1 , further conjugated to an effector component. 

10 

13. The antibody of Claim 12, wherein the effector component is a fluorescent label. 

14. The antibody of Claim 12, wherein the effector component is a radioisotope or a 
cytotoxic chemical 

15 

1 5 . The antibody of Claim 1 1 , which is 

a) an antibody fragment; or 

b) a humanized antibody 

20 16. A method of detecting a bladder cancer cell in a biological sample from a patient, the 
method comprising contacting the biological sample with an antibody of Claim 1 1. 

17. The method of Claim 16, wherein the antibody is further conjugated to an effector 
component. 

25 

1 8. The method of Claim 1 7, wherein the effector component is a fluorescent label. 

1 9. A method for identifying a compound that modulates a bladder cancer-associated 
polypeptide, the method comprising the steps of: 
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a) contacting the compound with a bladder cancer-associated polypeptide, the 
polypeptide encoded by a polynucleotide that selectively hybridizes to a sequence at 
least 80% identical to a sequence as shown in Tables 1A-13; and 

b) determining the functional effect of the compound upon the polypeptide. . 

5 

20. A drug screening assay comprising the steps of 

a) administering a test compound to a mammal having bladder cancer or a cell isolated 
therefrom; 

b) comparing the level of gene expression of a polynucleotide that selectively hybridizes 
10 to a sequence at least 80% identical to a sequence as shown in Tables 1 A-l 3 in a 

treated cell or mammal with the level of gene expression of the polynucleotide in a 
control cell or mammal, wherein a test compound that modulates the level of 
expression of the polynucleotide is a candidate for the treatment of bladder cancer. 
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